PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) \ 



(51) International Patent Classification 7 : 

C12N 15/12, C07K 14/47, A61K 38/17, 
G01N 33/53, C12Q 1/68, C12N 15/62, 
C07K 16/18 



A2 



(11) International Publication Number: WO 00/5375$ 

(43) International Publication Date: 1 4 September 2000 ( 1 4.09.00) 



(21) International Application Number: PCT/US00/05841 

(22) International Filing Date: 2 March 2000 (02.03.00) 



(30) Priority Data: 

PCI7US99/05028 
60/123,618 
60/123,957 
60/125,775 
60/128,849 
PCT/US99/08615 
60/131,445 
60/132,371 
60/134,287 
PCT/US99/12252 
. 60/141,037 
60/144,758 
60/145,698 
60/146,222 
PCT/US99/20Ui 
PCT/US99/20594 
PCT/US99/20944 
PCT/US99/21090 
PCT/US99/21547 
PCT/US99/23089 
60/162,506 
PCT/US99/28214 
PCT/US99/28313 
PCT/US99/28409 
PCT/US99/28301 
PCT/US99/28634 
PCT/US99/28551 
PCT/US99/28564 
PCT/US99/28565 
PCT/US99/30095 
PCT/US99/30999 
PCT/US99/31274 
PCT/US(XV00219 
PCT/US0G/00277 
PCT/US0Q/00376 
PCT/US0(y03565 
PCT/US0CVO4341 
PCT/US00/04342 
PCT/USOO/04414 



8 March 1999 (08.03.99) US 

10 March 1999(10.03.99) US 

12 March 1999(12.03.99) US 

23 March 1999 (23.03.99) , US 

12 April 1999 (12.04.99) US 
20 April 1999 (20.04.99) US 
28 April 1999 (28.04.99) US 

4 May 1999 (04.05.99) US 

14 May 1999(14.05.99) US 
2 June 1999(02.06.99) US 
23 June 1999 (23.06.99) US 
20 July 1999 (20.07.99) US 
26 July 1999 (26.07.99) US 

28 July 1999 (28.07.99) US 
1 September 1999 (01.09.99) US 
8 September 1999 (08.09.99) US 

1 3 September 1999 ( 1 3.09.99) US 

1 5 September 1999 ( 1 5.09.99) US 

15 September 1999 (15.09.99) US 

5 October 1999 (05.10.99) US 

29 October 1999 (29.10.99) US 

29 November 1999 (29.1 1 .99) US 

30 November 1999 (30.1 1 .99) US 
30 November 1999 (30.1 1 .99) US 
1 December 1999 (01.12.99) US 

1 December 1999 (01 .12.99) US 

2 December 1999 (02.12.99) US 
2 December 1999 (02.12.99) US 
2 December 1999 (02.12.99) US 

16 December 1999 (16.12.99) US 
20 December 1999 (20.12.99) US 
30 December 1999 (30.12.99) US 

5 January 2000 (05.01 .00) US 

6 January 2000 (06.01 .00) US 
6 January 2000 (06.01 .00) US 
1 1 February 2000 (1 1 .02.00) US 
18 February 2000 (18.02.00) US 
1 8 February 2000 (1 8.02.00) US 
22 February 2000 (22.02.00) US 



(71) Applicant (for ail designated States except US): GENENTECH, 

INC. [US/US]; 1 DNA Way, South San Francisco, CA 
94080-4990 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ASHKENAZI, Avi, J. 
[US/US]; 1456 Tarrytown Street, San Mateo, CA 94402 
(US). BAKER, Kevin, P. [GB/US]; 14006 Indian Run 
Drive, Darnestown, MD 20878 (US). GODDARD, Audrey 
[CA/USJ; 110 Congo Street, San Francisco, CA 94131 
(US). GURNEY, Austin, L. [US/US]; 1 Debbie Lane, 
Belmont, CA 94002 (US). HEBERT, Caroline [US/US]; 
1809 Vine Street, Berkeley, CA 94703 (US). HENZEL, 
William [US/US]; 3724 Southwood Drive, San Mateo, 
CA 94030 (US). KABAKOFF, Rhona, C. [BR/US]; 1084 
Granada Drive, Pacifica, CA 94044 (US). LU, Yanmei 
[CN/US]; 1001 Continentals Way #206, Belmont, CA 94002 
(US). PAN, James [CA/US]; 2705 Coronet Boulevard, 
Belmont, CA 94002 (US). PENNICA, Diane [US/US]; 
2417 Hale Drive, Burlingame, CA 94010 (US). SHELTON, 
David, L. [US/US]; 5845 Clover Drive, Oakland, CA 
94618 (US). SMITH, Victoria [AU/US]; 19 Dwight Road, 
Buriingame, CA 94010 (US). STEWART, Timothy, A. 
[US/US]; 465 Douglass Street, San Francisco, CA 94114 
(US). TUMAS, Daniel [US/US1; 3 Rae Court, Orinda, 
CA 94563 (US). WATANABE, Colin, K. [US/US]; 128 
Corliss Drive, Moraga, CA 94556 (US). WOOD, William.. 
I. (US/US]; 35 Southdown Court, Hillsborough, CA 94010 
(US). YAN, Minhong [CN/US]; 1910 Garden Drive #114, 
Burlingame, CA 94010 (US). 

(74) Agents: SVOBODA, Craig, G. et al.; Genentech, Inc., 1 DNA 
Way, South San Francisco, CA 94080-4990 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG, MK, MN, MW, MX, NO. NZ, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ r TM, TR, TT, TZ, UA, UG, 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, 
LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, a, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published 

Without international search report and to be republished 
upon receipt of that report 



(54) Tide: COMPOSITIONS AND METHODS FOR THE TREATMENT OF IMMUNE RELATED DISEASES 
(57) Abstract 

The present invention relates to a composition containing novel proteins and methods for the diagnosis and treatment of immune 
related diseases. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


AM 


Armenia 


Fl 


Finland 


AT 


Austria 


FR 


France 


AU 


Australia 


GA 


Gabon 


AZ 


Azerbaijan 


GB 


United Kingdom 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


BB 


Barbados 


GH 


Ghana 


BE 


Belgium 


GN 


Guinea 


BF 


Burkina Faso 


GR 


Greece 


BG 


Bulgaria 


HU 


Hungary 


BJ 


Benin 


IE 


Ireland 


BR 


Brazil 


IL 


Israel 


BY 


Belarus 


IS 


Iceland 


CA 


Canada 


IT 


Italy 


CF 


Central African Republic 


JP 


Japan 


CG 


Congo 


KE 


Kenya 


CH 


Switzerland 


KG 


Kyrgyzstan 


CI 


Cdte d'lvoire 


KP 


Democratic People's 


CM 


Cameroon 




Republic of Korea 


CN 


China 


KR 


Republic of Korea 


CU 


Cuba 


KZ 


Kazakstan 


CZ 


Czech Republic 


LC 


Saint Lucia 


DE 


Germany 


U 


Liechtenstein 


DK 


Denmark 


LK 


Sri Lanka 


EE 


Estonia 


LR 


Liberia 



LS 


Lesotho 


SI 


Slovenia 


LT 


Lithuania 


SK 


Slovakia 


LU 


Luxembourg 


SN 


Senegal 


LV 


Latvia 


sz 


Swaziland 


MC 


Monaco 


TD 


Chad 


MD 


Republic of Moldova 


TG 


Togo 


MG 


Madagascar 


TJ 


Tajikistan 


MK 


The former Yugoslav 


TM 


Turkmenistan 




Republic of Macedonia 


TR 


Turkey 


ML 


Mali 


TT 


Trinidad and Tobago 


MN 


Mongolia 


UA 


Ukraine 


MR 


Mauritania 


UG 


Uganda 


MW 


Malawi 


US 


United States of America 


MX 


Mexico 


uz 


Uzbekistan 


NE 


Niger 


VN 


Viet Nam 


NL 


Netherlands 


YU 


Yugoslavia 


NO 


Norway 


zw 


Zimbabwe 


NZ 


New Zealand 






PL 


Poland 






PT 


Portugal 






RO 


Romania 






RU 


Russian Federation 






SD 


Sudan 






SE 


Sweden 






SG 


Singapore 







WO 00/53758 



PCT/USQO/05841 



COMPOSITIONS AND METHODS FOR THE TREATMENT OF IMMUNE RELATED DISEASES 



Field of the [nvention 

5 The present invention relates to compositions and methods for the diagnosis and treatment of immune 

related diseases. 

Background of the Invention 
Immune related and inflammatory diseases are the manifestation or consequence of fairly complex, 

10 often multiple interconnected biological pathways which in normal physiology are critical to respond to insult 
or injury, initiate repair from insult or injury, and mount innate and acquired defense against foreign organisms. 
Disease or pathology occurs when these normal physiological pathways cause additional insult or injury either 
as directly related to the intensity of the response, as a consequence of abnormal regulation or excessive 
stimulation, as a reaction to self, or as a combination of these. 

15 Though the genesis of these diseases often involves multistep pathways and often multiple different 

biological systems/pathways, intervention at critical points in one or more of these pathways can have an 
ameliorative or therapeutic effect. Therapeutic intervention can occur by either antagonism of a detrimental 
process/pathway or stimulation of a beneficial process/ pathway. 

Many immune related diseases are known and have been extensively studied. Such diseases include 

20 immune-mediated inflammatory diseases, non- immune-mediated inflammatory diseases, infectious diseases, 
immunodeficiency diseases, neoplasia, etc. 

T lymphocytes (T cells) are an important component of a mammalian immune response. T cells 
recognize antigens which are associated with a self-molecule encoded by genes within the major 
histocompatibility complex (MHC). The antigen may be displayed together with MHC molecules on the 

25 surface of antigen presenting cells, virus infected cells, cancer cells, grafts, etc. The T cell system eliminates 
these altered ceils which pose a health threat to the host mammai. T ceils include helper T celLs and cytotoxic 
T cells. Helper T cells proliferate extensively following recognition of an antigen -MHC complex on an 
antigen presenting cell. Helper T cells also secrete a variety of cytokines. Le., lymphokines. which play a 
central role in the activation of B cells, cytotoxic T cells and a variety of other cells which participate in the 

30 immune response. 

A central event in both humoral and ceil mediated immune responses is the activation and clonal 
expansion of helper T cells. Helper T cell activation is initiated by the interaction of the T cell receptor (TCR) 
- GD3 complex with an antigen-MHC on the surface of an antigen presenting cell. This interaction mediates a 
cascade of biochemical events that induce the resting helper T cell to enter a cell cycle (the GO to Gl 

35 transition) and results in the expression of a high affinity receptor for IL-2 and sometimes IL-4. The activated 
T cell progresses through the cycle proliferating and differentiating into memory cells or effector cells. 

In addition to the signals mediated through the TCR, activation of T cells involves additional 
costimuiation induced by cytokines released by the antigen presenting cell or through interactions with 
membrane bound molecules on the antigen presenting cell and the T cell The cytokines IL-l and IL-6 have 

40 been shown to provide a costimulatory signal. Also, the interaction between the B7 molecule expressed on the 

I 
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surface of an antigen presenting cell and CD28 and CTLA-4 molecules expressed on the T cell surface ettect 1 
cell activation. Activated T cells express an increased number of cellular adhesion molecules, such as ICAM- 
1, integrins, VLA-4, LFA-1, CD56. etc. 

T-cell proliferation in a mixed lymphocyte culture or mixed lymphocyte reaction (MLR) is. an 
5 established indication of the ability of a compound to stimulate the immune system. In many immune 
responses, inflammatory cells infiltrate the site of injury or infection. The migrating cells may be neutrophilic, 
eosinophilic, monocytic or lymphocytic as can be determined by histologic examination of the affected tissues. 
Current Protocols in Immunology, ed. John E. Coligan. 1994. John Wiley & Sons. Inc. 

Immune related diseases can be treated by suppressing the immune response. Using neutralizing 
10 antibodies that inhibit molecules having immune stimulatory activity would be beneficial in the treatment of 
immune-mediated and inflammatory diseases. Molecules which inhibit the immune response can be utilized 
(proteins directly or via the use of antibody agonists) to inhibit the immune response and thus ameliorate 
immune related disease. 

I j Summarv of the Invention 

The present invention concerns compositions and methods tor the diagnosis and treatment of immune 
related disease in mammals, including humans. The present invention is based on the identification of proteins 
(including agonist and antagonist antibodies) which either stimulate or inhibit the immune response in 
mammals. Immune related diseases can be treated by suppressing or enhancing the immune response. 
20 Molecules that enhance the immune response stimulate or potentiate the immune response to an antigen. 
Molecules which stimulate the immune response can be used therapeutically where enhancement of the 
immune response would be beneficial. Such stimulatory molecules can also be inhibited where suppression of 
the immune response would be of value. 

Neutralizing antibodies are examples of molecules that inhibit molecules having immune stimulatory 
25 activity and which would be beneficial in the treatment of immune related and inflammatory diseases. 
Molecuies which inhibit the immune response can also be utilized (proteins directly or via the use of antibody 
agonists) to inhibit the immune response and thus ameliorate immune related disease. 

Accordingly, the PRO polypeptides and anti-PRO antibodies and fragments thereof arc useful for the 
diagnosis and/or treatment (including prevention) of immune related diseases. Antibodies which bind to 
30 stimulatory proteins are useful to suppress the immune system and the immune response. Antibodies which 
bind to inhibitory proteins are useful to stimulate the immune system and the immune response. The PRO 
polypeptides and anti-PRO antibodies also useful to prepare medicines and medicaments for the treatment of 
immune related and inflammatory diseases. 

In one embodiment, the invention provides for isolated nucleic acid molecules comprising nucleotide 

35 sequences that encodes a PRO polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid identity, alternatively at 
least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 

40 sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 



SUBSTITUTE SHEET (RULE 26) 



PCTAJSOO/05841 

WO 00/53758 

87% nucleic acid sequence identity, alternatively at least about 38?'. nucleic acid sequence identity, 
alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid 
sequence identity, alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 
92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, 
alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid 
sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 
97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid sequence identity and 
alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule encoding a PRO 
polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the 
signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without the 
signal peptide, as disclosed herein or any other specifically defined fragment of the full-length amino acid 
sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, 
alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic ac.d 
sequence .dentiry. alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 
85% nucleic add sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 
sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 
90% nucleic acid sequence identity, alternately at least about 91% nucleic ac.d sequence identity, 
alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid 
sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 
95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, 
alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid 
sequence identity, alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule 
comprising the coding sequence of a full-length PRO polypeptide cDNA as disclosed herein, the coding 
sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an 
extracellular domain of a transmembrane PRO polypeptide, with or without the signal peptide, as disclosed 
herein or the coding sequence of any other specifically defined fragment of the full-length amino acid sequence 
as disclosed herein.' or (b) the complement of the DNA molecule of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 
sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid 
sequence identity, alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 
83% nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence identity, 
alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid 
sequence identity, alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 
88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid sequence identity, 
alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid 
sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 
93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, 
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alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid 
sequence identity, alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 
98% nucleic acid sequence identity, alternatively at least about 99% nucleic acid sequence identity to (a) a 
DNA molecule that encodes the same mature polypeptide encoded by any of the human protein cDNAs 
deposited with the ATCC as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention provides for isolated nucleic acid molecule comprising a nucleotide 
sequence encoding a PRO polypeptide with is either transmembrane domain-deleted or transmembrane 
domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domain(s) of such polypeptides are disclosed herein. Therefore, soluble extracellular domains of the herein 
described PRO polypeptides are contemplated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or the 
complement thereof, that may find use as. for example, hybridization probes, for encoding fragments of a PRO 
polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO polypeptide 
antibody or as antisense oligonucleotide probes. Such nucleic acid fragments are usually at least about 20 
nucleotides in length, alternatively a. least about 30 nucleotides in length, alternatively at least about 40 
nuclcot.de* in lencth. altemat.vclv a. least about 50 nucleotides in length, altemat.vely at least about 60 
nucleotides in length, altemat.vely a, least about 70 nucleotides in length, altemat.vely at least about 80 
nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 100 
nucleotides in length, alternatively at least about 1 10 nucleotides in length, alternatively a. least about 120 
nucleotides in length, alternatively at least about 130 nucleotides in length, alternatively at least about 140 
nucleoudes in length, altemat.vely a. least about 150 nucleotides in length, alternatively at least about 160 
nucleotides in length, altemat.vely a. least about 170 nucleotides in length, alternatively at least, about 1 80 
nucleotides in length, alternatively at least about 190 nucleotides in length, alternatively at least about 200 
nucleotides in length, alternatively at leas, about 250 nucleotides in length, alternatively at leas, about 300 
nucleotides in length, alternatively at least about 350 nucleotides in length, alternatively a. least about 400 
nucleotides in length, altemat.vely at least about 450 nucleotides in length, alternately at least about 500 
nucleotides in length, alternatively at leas, about 600 nucleotides in length, alternatively at least about 700 
nucleotides in length, alternatively at least about 800 nucleotides in length, alternatively at least about 900 
nucleotides in length, alternatively at leas, about 1000 nucleotides in length, alternatively at least about 1500 
nucleotide in length, alternatively at least about 2000 nucleotides in length, alternatively at least about 2500 
nucleotide in length, alternatively at least about 3000 nucleotide in length, alternatively at least about 4000 
nucleotide in length, alternatively at least about 5000 nucleotides in length, or more, wherein in this context the 
term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length. It 
is noted that novel fragments of a nucleotide sequence encoding the respective PRO polypeptide may be 
determined in a routine manner by aligning the respective nucleotide encoding a PRO polypeptide with other 
known nucleotide sequences using any of a number of well known sequence alignment programs and 
determining which nucleotide sequence fragments) are novel. All such nucleotide sequences encoding the 
respective PRO polypeptides are contemplated herein. Also contemplated are the nucleotide molecules which 
encode fragments of the PRO polypeptides, preferably those polypeptide fragments that comprise a binding site 
for an anti-PRO polypeptide antibody. 



4 
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In another embodiment, the invention provides isolated PRO polypeptides encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid 
sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 
83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively 
at least about 85% ammo acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid 
sequence identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about 
90% amino acid sequence identity, alternatively at least about 91% amino acid sequence identity, alternatively 
at least about 92% amino acid sequence identity alternatively at least about 93% amino acid sequence identity, 
alternatively at least about 94% ammo acid sequence identity, alternatively at least about 95% ammo acid 
sequence identity, alternatively at least about 96% ammo acid sequence identity, alternatively at least about 
97% ammo acid sequence identity, alternatively at least about 98% amino acid sequence identity, alternatively 
at least about 99% amino acid sequence identity to a PRO polypeptide having a full-length amino acid 
sequence as disclosed herein, an ammo acid sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of the full-length amino acid sequence as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% ammo acid 
sequence identity, alternatively at least about 82% ammo acid sequence identity, alternatively at least about 
83% amino acid sequence identity, alternatively at least about 84% ammo acid sequence identity, alternatively 
at least about 85% ammo acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% ammo acid sequence identity, alternatively at least about 88% ammo acid 
sequence identity, alternatively at least abour 89% ammo acid sequence identity, alternatively at least aboui 
90% ammo acid sequence identity, alternatively at least about 91% amino acid sequence identity, alternatively 
at least about 92% amino acid sequence identity, alternatively at least about 93% ammo acid sequence identity, 
alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid 
sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 
97% amino acid sequence identity, alternatively at least about 98% amino acid sequence identity, alternatively 
at least about 99% amino acid sequence identity to an amino acid sequence encoded by any of the human 
protein cDNAs deposited with the ATCC as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence scoring at least about 80% positives, alternatively at least about 81% positives, alternatively at least 
about 82% positives, alternatively at least about 83% positives, alternatively at least about 84% positives, 
alternatively at least about 85% positives, alternatively at least about 86% positives, alternatively at least about 
87% positives, alternatively at least about 88% positives, alternatively at least about 89% positives, 
alternatively at least about 90% positives, alternatively at least about 91% positives, alternatively at least about 
92% positives, alternatively at least about 93% positives, alternatively at least about 94% positives, 
alternatively at least about 95% positives, alternatively at least about 96% positives, alternatively at least about 
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970/. positives, alternatively at least about 98% posttives. alternatively at least about 99% pos.tives when 
compared wtth the amino acid sequence of a PRO polypeptide havtng a full-length amino actd sequence as 
disclosed herein, an ammo acid sequence lacking the signal peptide as disclosed herem. an extracellular 
domain of a transmembrane protein, with.or without the signal pepttde. as disclosed herem or any other 
specifically defined fragment of the full-length amino acid sequence as disclosed herein. 

In a specific aspect, the invention provtdes an isolated PRO polypeptide without the N-.erminal stgnal 
sequence and'or the initiating methionine and .s encoded by a nucleotide sequence that encodes such an ammo 
add sequence as hereinbefore described. Processes for produc.ng the same are a.so herein desenbed. where* 
those processes comprise culturing a host cell comprising a vector which comprises the appropriate encodmg 
nucle.c ac.d molecule under conditions su.table for express.on of the PRO polypept.de and recovenng the same 
from the ceil culture. 

In another aspect, the invention provides an .solated PRO polypeptide which is e.ther transmembrane- 
deleted or transmembrane domain-.nactivated. Processes for producing the same are also herein desenbed. 
wherein those processes comprise eultunne a host cell compns.ng a vector which comprises the approbate 
,-ncoding nuclei acid molecule under condition, suttable for expression of the PRO polypept.de and 
recoverinn the PRO polypeptide from the cell culture. 

In another embodiment, the invent.on provides vectors compns.ng DNA encod.ng any of the PRO 
po.ypept.de., Host ceils compns.ng any such vector are also provided. By way of example, the host cells may 
h, CHO cells. E. coli or yeast. A process for producing any of the herein described polypeptides .s further 
provided and composes culturing host cells under conditions su.table for expression of the des.red 
polypeptides and recovering the desired polypeptide from the cell culture. 

In other embodiments, the invention provides chimenc molecules compris.ng any of the herem 
described polypcpudes fused to a heterologous polypeptide or ammo acid sequence. Examples of such 
chimeric molecules compnse any of the herein desenbed polypeptides fused .0 an epitope tag sequence or a Fc 

region of an immunoglobulin. 

I„ yet other embod.mems. the invention prov.des oligonucleotide probes useful for isolat.ng genomic 
and cDNA nucleot.de sequences or as amisense probes, wherem those probes may be derived from any of the 
above or below described nucleotide sequences. 

In yet another embodiment, the uwent.on concerns agonists and antagonists of the PRO poiypept.des. 
lh at m.m.c or inhibit one or more functions or activities of the PRO poiypept.des. In a particular embodunent, 
the agonist or antagonist is an antibody that binds to the PRO polypeptides or a small molecule. 

In another embodiment, the invention provides an antibody which specifically binds to any of the 
above or below desenbed polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-chain antibody. In one aspect, the present invention concerns an .solatc 
antibodv which binds a PRO polypeptide. In another aspect, the antibody mimics the activtty of a PR 
polypeptide (an agonist antibody) or conversely the antibody inhibits or neutralizes the activry of a PR 
polypeptide (an antagonist antibody). In another aspect, the antibody is a monoclonal antibody, which 
preferably has nonhuman complementarity determining region (CDR) residues and human framework region 
(FR) residues. The antibody may be labeled and may be immobilized on a solid support In a further aspect, 
the antibody is an antibody fragment, a monoclonal antibody/a single-chain antibody, or an anti-tdiotyp.c 
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antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprises contacting the PRO polypeptide with a candidate molecule and monitoring 
a biological activiry mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native sequence 
5 PRO polypeptide. 

In another embodiment, the invention concerns a composition of matter containing PRO polypeptide 
or an agonist or antagonist antibody which binds the polypeptide in admixrure with a carrier or excipient. In 
one aspect, the composition contains a therapeutically effective amount of the peptide or antibody. In another 
aspect, when the composition contains an immune stimulating molecule, the composition is useful for: (a) 

10 increasing infiltration of inflammatory cells into a tissue of a mammal in need thereof, (b) stimulating or 
enhancing an immune response in a mammal in need thereof, or (c) increasing the proliferation of T- 
lymphocytes in a mammal in need thereof in response to an antigen. In a further aspect, when the composition 
contains an immune inhibiting molecule, the composition is useful for: (a) decreasing infiltration of 
inflammatory cells into a tissue of a mammal in need thereof, (b) inhibiting or reducing an immune response 

15 in a mammal in need thereof, or (c) decreasing the proliferation of T-lymphocytes in a mammal in need thereof 
in response to an antigen. In another aspect, the composition contains a further active ingredient, which may, 
for example, be a further antibody or a cytotoxic or chemotherapeutic agent. Preferably, the composition is 
sterile. 

In another embodiment, the invention concerns the use of the polypeptides and antibodies of the 

20 invention to prepare a composition or medicament which has the uses described above. 

In a further embodiment, the invention concerns nucleic acid encoding an anti-PRO200, anti-PRO204, 
anti-PR0212. anti-PR0216, anti-PR0226, anti-PRO240, anti-PR0235. anti-PR0245. anti-PROI72, anti- 
PR0273. ana-PR0272. anti-PR0332. anti-PR0526, anti-PRO70I. anti-PR0361. anti-PR0362. anti-PR0363, 
anti-PR0364. anti-PR0356. anti-PR0531, anti-PR0533. anti-PRO1083. anti-PR0865. anti-PRO770. anti- 

25 PR0769. anti-PR0788. anti-PROl 1 14, anti-PROl 007, anti-PROl 184, anti-PROl031. anti-PR01346, anti- 
PR01I55. anti-PROl250. anti-PROI3 12, anti-PROl 192. anti-PR01246. anti-PR012S3. anti-PROU95. anti- 
PROI343. anti-PR0l418. anti-PROl 387. anti-PRO!410. anti-PR019l7. anti-PR01868. anti-PRO205. anti- 
PR021. anti-PR0269. ami-PR0344, anti-PR0333, anti-PR0381. anti-PRO720, anti-PR0866, anti-PRO840, 
ami-PR0982. anti-PR0836. anti-PROl 159. anu-PROI358, anti-PRO!325, anti-PR01338. anti-PR01434, 

30 anti-PR04333, anti-PRO4302. anti-PRO4430 or anti-PR05727 antibody, and vectors and recombinant host 
cells comprising such nucleic acid. In a still further embodiment, the invention concerns a method for 
producing such an antibody by culruring a host cell transformed with nucleic acid encoding the antibody under 
conditions such that the antibody is expressed, and recovering the antibody from the cell culture. 

In a further embodiment, the invention concerns an isolated nucleic acid molecule that hybridizes to 

35 the a nucleic acid molecule encoding a PRO polypeptide, or the complement thereof. The nucleic acid 
preferably is DNA. and hybridization preferably occurs under stringent conditions. Such nucleic acid 
molecules can act as antisense molecules of the amplified genes identified herein, which, in turn, can find use 
in the modulation of the respective amplified genes, or as antisense primers in amplification reactions. 
Furthermore, such sequences can be used as part of ribozyme and/or triple helix sequence which, in turn, may 

40 be used in regulation of the amplified genes. 
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In another embodiment, the invention concerns a method for determining the presence of a PRO 
polypeptide comprising exposing a ceil suspected of containing and/or expressing the polypeptide to an anti- 
PRO200, anti-PRO204, anti-PR0212. anti-PR0216. anti-PR0226, anti-PRO240, anti-PR0235, anti-PR0245, 
anti-PRO 172, anti-PR0273, anti-PR0272, anti-PR0332. anti*PR0526 ? anti-PRO701, ami-PR0361, anti- 
5 PR0362, anti-PR0363, anti-PR0364, anti-PR0356, anti-PR053 1, anti-PR0533, anti-PRO1083. anti-PR0865, 
anti-PRO770, anti-PR0769, anti-PR0788. anti-PROll 14. anti-PRO1007, anti-PROl I84 ? anti-PRO1031. anti- 
PRO^, anti-PROl 155. anti-PRO1250. anti-PR01312, anti-PROl 192. anu-PROl246, anti-PR01283, anti- 
PROl 195, anti-PR01343. anti-PRO!418, anti-PR0l387, anti-PRO 1 4 1 0, anti-PROI917, ami-PROl868, ami- 
PRO205. anti-PR021. anti-PR0269. anti-PR0344. ami-PR0333. anti-PR0381. anti-PRO720. anti-PR0866, 

10 anti-PRO840, ami-PR0982. anti-PR0836. anti-PROl 159, anti-PRO!358, anti-PR01325. anti-PR01338. anti- 
PRO 1434, anti-PR04333. anti-PRO4302. ami-PRO4430 or anti-PR05727 antibody and determining binding 
of the antibody to the ceil. 

In yet another embodiment, the present invention concerns a method of diagnosing an immune related 
disease in a mammal, comprising detecting the level of expression of a gene encoding a PRO polypeptide (a) in 

15 a test sample of tissue cells obtained from the mammal, and (b) in a control sample of known normal tissue 
ceils of the same cell type, wherein a higher or lower expression level in the test sample as compared to the 
control sample indicates the presence ol' immune related disease in the mammal from which the test tissue ceils 
were obtained. 

In another embodiment, the present invention concerns a method of diagnosing an immune disease in 

20 a mammal, comprising (a) contacting an anti-PRO polypeptide antibody with a test sample of tissue ceils 
obtained from the mammal, and (bj detecting the formation of a complex between the antibody and the 
respective PRO polypeptide, respectively, in the test sample; wherein the formation of said complex is 
indicative of the presence or absence of said disease. "The detection may be qualitative or quantitative, and may 
be performed in comparison with monitoring the complex formation in a control sample of known normal 

25 tissue cells of the same cell type. A larger quantity of complexes formed in the test sample indicates the 
presence or absence of an immune disease in the mammal from which the test tissue cells were obtained. The 
antibody preferably carries a detectable label. Complex formation can be monitored, for example, by light 
microscopy, flow cytometry, fluorimetry. or other techniques known in the art. The test sample is usually 
obtained from an individual suspected of having a deficiency or abnormality of the immune system. 

30 In another embodiment, the present invention concerns a diagnostic kit. containing an anti-PRO200. 

anti-PRO204, anti-PR0212, anti-PR0216, anti-PR0226. anti-PRO240, anti-PR0235, anti-PR0245, anti- 
PR0172, anti-PR0273, anti-PR0272. anti-PR0332. anti-PR0526, anti-PRO701. anti-PR036I. anti-PR0362, 
anti-PR0363. anti-PR0364, anti-PR0356. anti-PR0531. anti-PR0533, ami-PRO1083, anti-PR0865, ami- 
PRO770. anti-PR0769, anti-PR0788, anti-PROl 1 14, anti-PRO1007, anti-PROl 184, anti-PRO1031, and- 

35 PR01346. anti-PROl 155, anti-PRO1250, anti-PR013 12. anti-PROl 192, anti-PROI246, anti-PRO!283, anti- 
PROl 195, anti-PRO 1343, ami-PR01418, anti-PR01387 t anti-PRO14l0, anti-PR01917. anti-PROI868, anti- 
PRO205, anti-PR021, anti-PR0269, anti-PR0344, anti-PR0333, anti-PR0381, anti-PRO720, anti-PR0866, 
anti-PRO840 f anti-PR0982, anti-PR0836, anti-PROl 159, anti-PRO!358, anti-PR01325 r anti-PR01338, anti- 
PR01434, anti-PR04333, anri-PRO4302, anti-PRO4430 or anti-PR05727 antibody and a carrier (e,g.. a 

40 buffer) in suitable packaging. The kit preferably contains instructions for using the antibody to detect the PRO 
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polypeptide. 

In a further embodiment, the invention concerns an article of manufacture, comprising: 
a container, 

an instruction on the container, and 

a composition comprising an active agent contained within the container, wherein the composition is 
effective for stimulating or inhibiting an immune response in a mammal, the instruction on the container 
indicates that the composition can be used to treat an immune related disease, and the active agent in the 
composition is an agent stimulating or inhibiting the expression and/or activity of the PRO polypeptide. In a 
preferred aspect, the active agent is a PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235. 
PR0245. PR0172. PR0273, PR0272, PR0332, PR0526. PRO701, PR0361. PR0362. PR0363. PR0364, 
PR0356, PR0531, PR0533, PRO1083. PR0865. PRO770, PR0769, PR0788. PROI114. PRO1007, 
PR01184. PRO1031. PR01346. PR01I55, PROI250, PR01312, PR01192. PR01246. PR01283. PR01195. 
PR01343. PR01418, PR01387, PRO1410. PR019I7, PR01868, PRO205. PR021, PR0269, PR0344, 
PR0333. PR0381. PRO720. PR0866. PRO840. PR0982. PR0836, PR01159. PR01358. PROI325. 
PR01338. PR01434. PR04333. PRO4302. PRO4430 or PR05727 polypeptide or an anti-PRO200. anu- 
PRO204. anii-PR0212. ami-PR0216. anti-PR0226. anu-PRO240. anti-PR0235. anti-PR0245, anu-PROI72. 
anti-PR0273. anti-PR0272. anti-PR0332. anti-PR0526. anti-PRO701. anu-PR0361. anti-PR0362. anti- 
PR0363. anti-PR0364. ami-PR0356, anti-PR0531 . anti-PR0533, anti-PROl 083. anti-PR0865. ami-PRO770. 
anti-PR0769. anu-PR0788. anti-PROl 1 14. anti-PROI007. anti-PROl 184, anti-PRO103 1. anti-PR01346, 
anti-PROit55. ami-PROi250. ami-PROi3i2, anti-PROl 192, ami-PROl246. anti-PRO!283, anti-PROl 195, 
anti-PR01343, anti-PROl4l8, anti-PR01387. anti-PRO1410. anti-PROl917. anti-PR01868, anu-PRO205, 
anti-PR02I. anti-PR0269. anti-PR0344, anti-PR0333. anti-PR0381. anti-PRO720, anti-PR0866, anti- 
PRO840. anu-PR0982. anti-PR0836, anti-PROl 159, anti-PR0l358. anti-PR01325. anti-PR0l338, anti- 
PR01434. anti-PR04333. anti-PRO4302. anti-PRO4430 or anti-PR05727 antibody. 

A further embodiment is a method for identifying a compound capable of inhibiting the expression 
and/or activity of a PRO polypeptide by contacting a candidate compound with a PRO polypeptide under 
conditions and for a time sufficient to allow these two components to interact. In a specific aspect, either the 
candidate compound or the PRO polypeptide is immobilized on a solid support. In another aspect the non- 
immobilized component carries a detectable label. 

Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an 
agonist or antagonist thereof as hereinbefore described, or an and- PRO antibody, for the preparation of a 
medicament useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or 
antagonist thereof or an anti-PRO antibody. 

Brief Description of the Drawings 
Figure I shows DNA29101-1276 (SEQ ID NO:l). 

Figure 2 shows the native sequence PRO200 polypeptide UNQI74 (SEQ ID NO:2). 
Figure 3 shows DNA30871-1 157 (SEQ ID NO:l 1). 

Figure 4 shows the native sequence partial length PRO204 polypeptide LJNQ178 (SEQ ID NO: 12) . 
Figure 5 shows DNA30942- 1 1 34 (SEQ ID NO: 13). 

9 
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Figure 6 shows the native sequence PR02I2 polypeptide UNQI86 (SEQ ID NO: 14). 
Figure 7 shows DNA33087-1 158 (SEQ ID NO: 18). 

Figure 8 shows the native sequence PR0216 polypeptide UNQI90 (SEQ ID NO: 19). 
Figure 9 shows DNA33460-1 166 (SEQ ID NO:20). 
5 Figure 10 shows the native sequence PR0226 polypeptide UNQ200 (SEQ ID NO:21). 

Figure 1 1 shows DNA34387-1 138 (SEQ ID NO:25). 

Figure 12 shows the native sequence PRO240 polypeptide UNQ214 (SEQ ID NO:26). 
Figure 1 3 shows DN A35558- 1 1 67 (SEQ ID NO:30). 

Figure 14 shows the native sequence PR0235 polypeptide LTNQ209 (SEQ ID NO:31). 
10 Figure 15 shows DNA35638-1 141 (SEQ ID NO:35). 

Figure 16 shows the native sequence PR0245 polypeptide UNQ219 (SEQ ID NO:36). 
Figure 17 shows DNA35916-1 161 (SEQ ID NO:40). 

Figure IS shows the native sequence PR0172 polypeptide UNQI46 (SEQ ID NO:41). 
Figure 1 9 shows DNA39523- 1 192 (SEQ ID NO:45). 
1 5 Figure 20 shows the native sequence PR0273 polypeptide L/NQ240 (SEQ ID NO:46). 

Figure 21 shows DNA40620-1 183 (SEQ ID NO:50). 

Figure 22 shows the native sequence PR0272 polypeptide UNQ239 (SEQ ID NO:5l). 
Figure 23 shows DNA40982-1235 (SEQ ID NO:56). 

Figure 24 shows the native sequence PR0332 polypeptide UNQ293 (SEQ ID NO:57). 
20 Figure 25 shows DNA44184-1319 (SEQ IDNO:6l). 

Figure 26 shows the native sequence PR0526 polypeptide UNQ330 (SEQ ID NO:62). 
Figure 27 shows DNA44 205- 1285 (SEQ ID NO:66). 

Figure 28 shows the native sequence PRO701 polypeptide UNQ365 (SEQ ID NO:67). 
Figure 29 shows DNA45410-1250 (SEQ ID NO:7I). 
25 Figure 30 shows the native sequence PR036 1 polypeptide UNQ3 1 6 (SEQ ID NO:72). 

Figure 31 shows DNA45416-1251 (SEQ ID NO:79). 

Figure 32 shows the native sequence PR0362 polypeptide UNQ317 (SEQ ID NO:80). 
Figure 33 shows DNA45419-1252 (SEQ ID NO:86). 

Figure 34 shows the native sequence PR0363 polypeptide UNQ318 (SEQ ID NO:87). 
30 Figure 35 shows DNA47365-1206 (SEQ ID NO:9I). 

Figure 36 shows the native sequence PR0364 polypeptide UNQ319 (SEQ ID NO:92). 
Figure 37 shows DNA47470-1 130 (SEQ ID NO: 101). 

Figure 38 shows the native sequence PR0356 polypeptide UNQ3 13 (SEQ ID NO: 102). 
Figure 39 shows DNA483 14-1320 (SEQ ID NO: 106). 
35 Figure 40 shows the native sequence PR053 1 polypeptide UNQ332 (SEQ ID NO: 107). 

Figure41 shows DNA49435-1219 (SEQ ID NO:l 11). 

Figure 42 shows the native sequence PR0533 polypeptide UNQ334 (SEQ ID NO: 1 12). 
Figure 43 shows DNA5092M458 (SEQ ID NO:116). 

Figure 44 shows the native sequence PRO 1083 polypeptide UNQ540 (SEQ ID NO: 117). 
40 Figure 45 shows DNA53974-1401 (SEQ ID NO: 123). 
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Figure 46 shows the native sequence PR0865 polypeptide UNQ434 (SEQ ID NO: 124). 
Figure 47 shows DNA54228- 1366 (SEQ ID NO: 133). 

Figure 48 shows the native sequence PRO770 polypeptide UNQ408 (SEQ ID NO: 134). 
Figure 49 shows DNA5423 1-1366 (SEQ ID NO: 139). 
5 Figure 50 shows the native sequence PR0769 polypeptide UNQ407 (SEQ ID NO: 140). 

Figure 51 shows DNA56405-1357 (SEQ ID NO: 141). 

Figure 52 shows the native sequence PR0788 polypeptide UNQ430 (SEQ ID NO: 142). 
Figure 53 shows DNA57033-1403XSEQ ID NO: 143). 

Figure 54 shows the native sequence PROl 1 14 polypeptide UNQ557 (SEQ ID NO: 144). 
10 Figure 55 shows DNA57690-1374 (SEQ ID NO: 1 45). 

Figure 56 shows the native sequence PRO 1007 polypeptide UNQ49! (SEQ ID NO: 146). 
Figure 57 shows DNA59220-1514 (SEQ ID NO:147). 

Figure 58 shows the native sequence PRO 1 184 polypeptide UNQ598 (SEQ ID NO: 148). 
Figure 59 shows DNA59294-138I (SEQ ID NO: 149). 
15 Figure 60 shows the native sequence PRO 1 031 polypeptide UNQ5I6 (SEQ ID NO: 150). 

Figure 6 1 shows DNA59776- 1 600 (SEQ ID NO: 151). 

Figure 62 shows the native sequence PRO 1 346 polypeptide (JNQ701 (SEQ ID NO: 152). 
Figure 63 shows DNA59849-1504 (SEQ ID NO: 1 56). 

Figure 64 shows the native sequence PROl 155 polypeptide UNQ585 (SEQ ID NO: 157). 
20 Figure 65 shows DNA60775-1532 (SEQ ID NO: 158). 

Figure 66 shows the native sequence PRO 1250 polypeptide UNQ633 (SEQ ID NO: 159). 
Figure 67 shows DNA61873-1574 (SEQ ID NO:160). 

Figure 68 shows the native sequence PRO 13 12 polypeptide UNQ678 (SEQ ID NO: 161). 
Figure 69 shows DNA62814-152I (SEQ ID NO: 162). 
25 Figure 70 shows the native sequence PROl 192 polypeptide UNQ606 (SEQ ID NO: 163). 

Figure 71 shows DNA64885-I529 (SEQ ID NO: 167). 

Figure 72 shows the native sequence PROI246 polypeptide UNQ630(SEQ ID N0:168). 
Figure 73 shows DNA65404- 1 55 1 (SEQ ID NO: 1 69). 

Figure 74 shows the native sequence PRO 1 283 polypeptide UNQ653 (SEQ ID NO: 1 70). 
30 Figure 75 shows DNA654 1 2- 1 523 (SEQ ID NO: 1 77). 

Figure 76 shows the native sequence PROl 195 polypeptide UNQ608 (SEQ ID NO: 178). 
Figure 77 shows DNA66675-1587 (SEQ ID NO: 1 79). 

Figure 78 shows the native sequence PRO 1343 polypeptide UNQ698 (SEQ ID NO: 180). 
Figure 79 shows DNA68864-1629 (SEQ ID NO: 184). 
35 Figure 80 shows the native sequence PROM 18 polypeptide UNQ732 (SEQ ID NO: 185). 

Figure 81 shows DNA68872-1620 (SEQ ID NO: 186). 

Figure 82 shows the native sequence PR01387 polypeptide UNQ722 (SEQ ID NO:187). 
Figure 83 shows DNA68874-1622 (SEQ ID NO: 188). 

Figure 84 shows the native sequence PROI410 polypeptide UNQ728 (SEQ ID NO: 189). 
40 Figure 85 shows DNA76400-2528 (SEQ ID NO: 1 90). 
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Figure 86 shows the native sequence PRO 191 7 polypeptide UNQ900 (SEQ ID NO: 191). 
Figure 87 shows DNA77624-2515 (SEQ ID NO: 192). 

Figure 88 shows the native sequence PR01868 polypeptide UNQ859 (SEQ ID NO: 193). 
Figure 89 shows DNA30868- 1 156 (SEQ ID NO:228). 
5 Figure 90 shows the partial native sequence PRO205 polypeptide UNO 179 (SEQ ID NO:229). 

Figure 91 shows DNA36638-1056 (SEQ ID NO:230). 

Figure 92 shows the native sequence PR02I polypeptide UNQ2I (SEQ ID NO:23I). 
Figure 93 shows DNA38260-M80 (SEQ ID NO:232). 

Figure 94 shows "the native sequence PR0269 polypeptide UNQ236 (SEQ ID NO:233). 
10 . Figure 95 shows DNA40592-I242 (SEQ ID NO:240). 

Figure 96 shows the native sequence PR0344 polypeptide UNQ303 (SEQ ID NO:24 1 ). 
Figure 97 shows DNA4I374-13I2 (SEQ ID NO:248). 

Figure 98 shows the partial length native sequence PR0333 polypeptide UNQ294 (SEQ ID NO:249). 
Figure 99 shows DNA44 1 94- 1 3 1 7 (SEQ ID NO:250). 
!5 Figure 100 shows (he native sequence PR0381 polypeptide UNQ322 (SEQ ID NO:251). 

Figure 101 shows DNA53517-1366 (SEQ ID NO:255). 

Figure 102 shows the native sequence PRO720 polypeptide UNQ388 (SEQ ID NO:256). 
Figure 103 shows DNA5397M 359 (SEQ ID NO:257). 

Figure 104 shows the native sequence PR0866 polypeptide UNQ435 (SEQ ID NO:258). 
20 Figure 105 shows DNA53987- 1438 (SEQ ID NO:266). 

Figure 106 shows the native sequence PRO840 polypeptide LFNQ433 (SEQ ID NO:267). 

Figure 107 shows DNA57700-1408 (SEQ ID NO:268). 
' Figure 108 shows the native sequence PR0982 polypeptide UNQ483 (SEQ ID NO:269). 

Figure 109 shows DNA59620-1463 (SEQ ID NO:270). L 
25 Figure 1 10 shows the native sequence PR0836 polypeptide UNQ545 (SEQ IDNO:27I). 

Figure 1 1 1 shows DNA60627- 1 508 (SEQ ID NO:272). 

Figure 1 12 shows the native sequence PRO II 59 polypeptide UNQ589 (SEQ ID NO:273). 
Figure 1 13 shows DNA64890- 16 1 2 (SEQ ID NO:274). 

Figure 1 14 shows the native sequence PR01358 polypeptide UNQ707 (SEQ ID NO:275). 
30 Figure 1 15 shows DNA66659- 1593 (SEQ ID NO:276). 

Figure 1 16 shows the native sequence PRO 1325 polypeptide UNQ685 (SEQ ID NO:277). 
Figure i 17 shows DNA66667- 1 596 (SEQ ID NO:278). 

Figure 1 18 shows the native sequence PR01338 polypeptide UNQ693 (SEQ ID NO:279). 
Figure 1 19 shows DNA688 18-2536 (SEQ ID NO:280). 
35 Figure 120 shows the native sequence PR01434 polypeptide UNQ739 (SEQ ID NO:28 1). 

Figure 121 shows DNA842 10-2576 (SEQ ID NO:285). 

Figure 122 shows the native sequence PR04333 polypeptide UNQ1888 (SEQ ID NO:286). 
Figure 123 shows DNA922 18-2554 (SEQ ID NO:292). 

Figure 124 shows the native sequence PRO4302 polypeptide UNQI866 (SEQ ID NO:293). 
40 Figure 125 shows DNA96878-2626 (SEQ ID NO:294). 
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Figure 126 shows the native sequence PRO4430 polypeptide UNQI947 (SEQ ID NO:295). 
Figure 127 shows DNA98853-1739 (SEQ ID NO:296). 

Figure 128 shows the native sequence PR05727 polypeptide UNQ2448 (SEQ ID NO:297). 
Detailed Description of the Preferred Embodiments 

I. Definitions 

The terms "PRO polypeptide(s)" and "PRO" as used herein and when immediately followed fay a 
numerical designation refer to various polypeptides, wherein the complete designation </>. "PRO/number ,f or 
more particularly. PRO200. PRO204. PR0212. PR02I6, PR0226. PRO240. PR0235. PR0245. PR0172, 
PR0273, PR0272, PR0332. PR0526. PRO701, PR0361. PR0362, PR0363. PR0364, PR0356. PR0531, 
PR0533. PROI083, PR0865, PRO770, PR0769, PR0788, PROH14. PRO1007, PROl 184, PRO1031, 
PR01346, PROI 155, PROI250, PR013I2. PROl 192, PROI246, PR012S3. PROl 195. PR01343. PR014I8, 
PRO 1387, PRO 14 10, PRO 19 17. PRO 1868, PRO205, PR021. PR0269, PR0344. PR0333. PR038L PRO720, 
PRO866. PRO840, PR0982, PR0836. PR0II59, PR01358, PROI325. PROI338. PR01434, PR04333, 
PRO4302. PRO4430 or PR05727) refers to particular polypeptide sequences as described herein. The terms 
"PRO/number polypeptide" and "PRO/number" wherein the term "number" is provided as an actual numerical 
designation {e.g.. as described above) as used herein encompass native sequence polypeptides and polypeptide 
variants (which are farther defined herein). The PRO polypeptides described herein may be isolated from a 
variety of sources, such as from human tissue types or from another source, or prepared by recombinant or 
synthetic methods. 

A "native sequence PRO polypeptide^)" comprises a polypeptide having the same amino acid 
sequence as the corresponding PRO polypeptide derived from nature. Such native sequence PRO/number 
polypeptides can be isolated from nature or can be produced by recombinant or synthetic means. "The term 
"native sequence PRO polypeptide(s)" specifically encompasses naturally -occurring truncated or secreted 
forms of the specific PRO/number polypeptide (e.g.. an extracellular domain sequence), naturally-occurring 
variant forms {e.g., alternatively spliced forms) and naturally-occurring allelic variants of the polypeptide. In 
various embodiments of the invention, the native sequence PRO polypeptides disclosed herein are mature or 
full-length native sequence polypeptides comprising the full-length amino acids sequences shown in the 
accompanying figures. Start and stop codons are shown in bold font and underlined in the figures. However, 
while the PRO/number polypeptides disclosed in the accompanying figures are shown to begin with 
methionine residues designated herein as amino acid position 1 in the figures, it is conceivable and possible 
that other methionine residues located either upstream or downstream from the amino acid position 1 in the 
figures may be employed as the starting amino acid residue for the PRO polypeptides. 

The "PRO polypeptide(s) extracellular domain'* or "ECD" refers to a form of the said polypeptide 
which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD 
will have less than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have less than 
0.5% of such domains. It will be understood that any transmembrane domains identified for the PRO 
polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for 
identifying that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but 
most likely by no more than about 5 amino acids at either end of the domain as, initially identified herein. 

13 
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Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer amino 
acids on either side of the transmembrane domain/extracellular domain boundary as identified in the Examples 
or specification and such polypeptides, with or without the associated signal peptide, and nucleic acid encoding 
them, are contemplated by the present invention. 

The approximate location of the "signal peptides" of the various PRO/number PRO polypeptides 
disclosed herein are shown in the present specification and/or the accompanying figures. It is noted, however, 
that the C-terminai boundary of a signal peptide may vary, but most likely by no more than about 5 amino 
acids on either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C- 
terminal boundary of the signal peptide may be identified pursuant to criteria routinely employed in the an for 
identifying that type of amino acid sequence element {e.g., Nielsen ei ai, Prot: Eng. 10:1-6 (1997) and von 
Heinje et aL NucL Acids. Res. |4:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, 
cleavage of a signal sequence from a secreted polypeptide is not entirely uniform, resulting in more than one 
secreted species. These mature polypeptides, where the signal peptide is cleaved within no more than about 5 
amino acids on either side of the C-terminal boundary of the signal peptide as identified herein, and the 
polynucleotides encoding them, are contemplated by the present invention. 

A "PRO polypeptide variant". "PRO/number variant" or "PRO variant" means an active PRO 
polypeptide as defined herein [e.g., below) having at least about 80% amino acid sequence identity with a full- 
length native sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the 
signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal 
peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed 
herein. Such PRO polypeptide variants include, for instance, polypeptides wherein one or more amino acid 
residues are added, or deleted, at the N- or C-termmus of the full-length native amino acid sequence. 
Ordinarily, a PRO polypeptide variant will have at least about 80% amino acid sequence identity, alternatively 
at least about 81% amino acid sequence identity, alternatively at least about 82% amino acid sequence identity, 
alternatively at least about 83% amino acid sequence identity, alternatively at least about 84% amino acid 
sequence identity, alternatively at least about 85% amino acid sequence identity, alternatively at least about 
86% amino acid sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively 
at least about 88% amino acid sequence identity, alternatively at Least about 89% amino acid sequence identity, 
alternatively at least about 90% amino acid sequence identity, alternatively at least about 91% amino acid 
sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 
93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, alternatively 
at least about 95% amino acid sequence identity, alternatively at least about 96% amino acid sequence identity, 
alternatively at least about 97% amino acid sequence identity, alternatively at least about 98% amino acid 
sequence identity, alternatively at least about 99% ammo acid sequence identity with a full-length native 
sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the signal 
peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, 
as disclosed herein or any other specifically defined fragment of a full-length PRO polypeptide sequence as 
disclosed herein. Ordinarily, PRO polypeptide variants are at least about 10 amino acids in length, 
alternatively at least about 20 amino acids in length, alternatively at least about 30 amino acids in length, 
alternatively at least about 40 amino acids in length, alternatively at least about 50 amino acids in length, 
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alternatively. at least about 60 amino acids in length, alternatively at least about 70 amino acids in length, 
alternatively at least about 80 amino acids in length, alternatively at least about 90 amino acids in length, 
alternatively at least about 100 amino acids in length, alternatively at least about 150 amino acids in length, 
alternatively at least about 200 amino acids in length, alternatively at least about 300 amino acids in length, 
alternatively at least about 400 amino acids on length, alternatively at least about 500 amino acids in length, 
alternatively at least about 600 amino acids in length, alternatively at least about 700 amino acids in length, 
alternatively at least about 800 amino acids in length, alternatively at least about 900 amino acids in length, 
alternatively at least about 1000 amino acids in length, alternatively at least bout 1200 amino acids in length, 
alternatively at least about 1400 amino acids in length, alternatively at least about 1500 amino acids in length 
or more. 

"Percent (%) ammo acid sequence identity" with respect to the PRO polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific PRO/number polypeptide sequence, after aligning the sequences and 
introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any 
conservative substitutions as pan of the sequence identity. Alignment for purposes of determining percent 
ammo acid sequence identiry can be achieved in various ways that are within the skill in the an, for instance, 
using publicly available computer software such as. BLAST, BLAST-2. ALIGN or Megalign (DNASTAR) 
software. Those skilled in the an can determine appropriate parameters for measuring alignment, including 
any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. 
For purposes herein, however. % amino acid sequence identity values are generated using the sequence 
comparison computer program ALIGN-2. wherein the complete source code for the ALIGN-2 program is 
provided in Table 1 below. The ALIGN-2 sequence comparison computer program was authored by 
Genentech. Inc. and the source code shown in Tables 1 below has been filed with user documentation in the 
U.S. Copyright Office. Washington D.C., 20559. where it is registered under U.S. Copyright Registration No. 
TXU5I0087. The ALIGN-2 program is publicly available through Genentech. Inc.. South San Francisco, 
California or may be compiled from the source code provided in Table 1 below. The ALIGN-2 program 
should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence 
comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identiry of a given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid 
sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in 
B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino 
acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence 
identity of B to A. As examples of % amino acid sequence identiry calculations using this method, Tables 2 
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and 3 demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated 
"Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO* represents the amino 
acid sequence of a hypothetical PRO/number polypeptide of interest, "Comparison Protein" represents the 
amino acid sequence of a polypeptide against which the "PRO" polypeptide of interest is being compared, and 
"X, "Y M and "Z" each represent different hypothetical ammo acid residues. 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However, % amino acid sequence identity values may also be obtained as described below by using the WU- 
BLAST-2 computer program (Altschul ei ai. Methods in Enzymology 266:460-480 (1996)). Most of the WU- 
BLAST-2 searc-h parameters are set to the default values. Those not set to default values, i.e.. the adjustable 
parameters, are set with the following values: overlap span = 1. overlap friction = 0.125, word threshold (T) = 
1 1, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % amino acid sequence identity 
value is determined by dividing (a) the number of matching identical amino acid residues berween the amino 
acid sequence of the PRO polypeptide of interest having a sequence derived from the native sequence PRO 
polypeptide and the comparison amino acid sequence of interest (i.e., the sequence against which the PRO 
polypeptide is being compared - which may be a PRO polypeptide variant) as determined by WU-BLAST-2 by 
(b) the total number of amino acid residues of the PRO polypeptide of interest. For example, in the statement 
"a polypeptide comprising an amino acid sequence A which has or having 'at least 80% amino acid sequence 
identity to the ammo acid sequence B", the amino acid sequence A is the comparison amino acid sequence of 
interest and the amino acid sequence B is the amino acid sequence of the PRO polypeptide of interest. 

Percent amino acid sequence identity may also be determined using the sequence comparison program 
NCBI-BLAST2 (Altschul et ai. Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from "http^/www.ncbi.nlm.gov" or otherwise obtained from the 
National Institute of Health. Bethesda. MD. NCBI-BLAST2 uses several search parameters, wherein all of 
those search parameters are set to default values including, for example, unmask = yes, strand = all. expected 
occurrences = 10. minimum low complexity length = 15/5. multi-pass e-value = 6.01. constant for multi-pass = 
25. dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % amino 
acid sequence identity of a given amino acid sequence A to. with, or against a given amino acid sequence B 
(which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino 
acid sequence identity to t with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program s alignment of A and B, and where Y is the total number of amino acid 
residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length 
of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid 
sequence identity of B to A. 



16 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCTAJSOO/0584? 



"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with a nucleotide sequence encoding: (1) a full-length native sequence PRO polypeptide as 
disclosed herein; (2) a full-length native sequence PRO polypeptide lacking the signal peptide as disclosed 
herein; (3) an extracellular, domain of a PRO polypeptide, with or without the signal peptide, as disclosed 
herein or (4) any other fragment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, a 
PRO polypeptide variant polynucleotide will have at least about 80% nucleic acid sequence identity, 
alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 82% nucleic acid 
sequence identity, alternatively at least about 83% nucleic acid sequence identity, alternatively at least about 
84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, 
alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid 
sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 
89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, 
alternatively at least about 91% nucleic acid sequence identiry. alternatively at least about 92% nucleic acid 
sequence identiry. alternatively at least about 93% nucleic acid sequence identiry. alternatively at least about 
94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid, sequence identity, 
alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid 
sequence identity, alternatively at least about 98% nucleic acid sequence identity, alternatively at least about 
99% nucleic acid sequence identiry with (I) a nucleic acid sequence encoding a full-length native sequence 
PRO polypeptide sequence as disclosed herein. (2) a full-length native sequence PRO polypeptide sequence 
lacking the signal peptide as disclosed herein, f 3 ) an extracellular domain of a PRO polypeptide sequence, with 
or without the signal sequence, as disclosed herein or (4) any other fragment of a full-length PRO polypeptide 
on sequence as disclosed herein. Variants do not encompass the native nucleotide sequence. 

Ordinanly. PRO polypeptide variant polynucleotides are at least about 30 nucleotides in length, 
alternatively at least about 60 nucleotides in length, alternatively at least about 90 nucleotides in length, 
alternatively at least about 120 nucleotides in length, alternatively at least about 150 nucleotides in length, 
alternatively at least about 180 nucleotides in length, alternatively at least about 210 nucleotides in length, 
alternatively at least about 240 nucleotides in length, alternatively at least about 270 nucleotides in length, 
alternatively at least about 300 nucleotides in length, alternatively at least about 450 nucleotides in length, 
alternatively at least about 500 nucleotides in length, alternatively at least about 600 nucleotides in length, 
alternatively at least about 700 nucleotides in length, alternatively at least about 800 nucleotides in length, 
alternatively at least about 900 nucleotides in length, alternatively at least about 1000 nucleotides in length, 
alternatively at least about 1200 nucleotides in length, alternatively at least about 1400 nucleotides in length, 
alternatively at least about 1600 nucleotides in length, alternatively at least about 1800 nucleotides in length, 
alternatively at least about 2000 nucleotides in length, alternatively at least about 2500 nucleotides in length, 
alternatively at least about 3000 nucleotides in length, alternatively at least about 3500 nucleotides in length, 
alternatively at least about 4000 nucleotides, alternatively at least about 5000 nucleotides or more. 

"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
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accessary, to achieve the maximum percent sequence identity. Alignment for purposes of detennining percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art for instance, 
using publicly available computer software such as BLAST, BLAST-2. ALIGN or Megaiign (DNASTAR) 
software. For purposes herein, however, % nucleic acid sequence identity values are generated using the 
sequence comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 
program is provided in Table 1 below. The ALIGN-2 sequence comparison computer program was authored 
by Genentech. Inc. and the source code shown in Table I below has been filed with user documentation in the 
U.S. Copyright Office. Washington D.C.. 20559. where it is registered under U.S. Copyright Registration No. 
TXU5I0087. The ALIGN-2 program is publicly available through Genentech. Inc.. South San Francisco, 
California or may be compiled from the source code provided in Table I below. The ALIGN-2 program 
should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence 
comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to. with, or against a given nucleic acid sequence D 
(which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to. with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN- 
2 in that program's alignment of C and D. and where Z is the total number of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. 
As examples of % nucleic acid sequence identity calculations. Tables 4 and 5. demonstrate how to calculate 
the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the 
nucleic acid sequence designated "PRO-DNA". wherein "PRO-DNA" represents a hypothetical PRO 
polypeptide - encoding nucleic acid sequence of interest, "Comparison DNA" represents the nucleotide 
sequence of a nucleic acid molecule against which the "PRO-DNA H nucleic acid molecule of interest is being 
compared, and "N w , "L" and "V" each represent different hypothetical nucleotides. 

Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However. % nucleic acid sequence identity values may also be obtained as described below by using the WU- 
BLAST-2 computer program (Altschul at aL, Methods in Enzymology 266:460-480 (1996)). Most of the WU- 
BLAST-2 search parameters are set to the default values. Those not set to default values, Le. t the adjustable 
parameters, are set with the following values: overlap span - 1, overlap fraction = 0.125. word threshold (T) = 
1 1, and scoring matrix - BLOSUM62. When WU-BLAST-2 is employed, a % nucleic acid sequence identity 
value is determined by dividing (a) the number of matching identical nucleotides between the nucleic acid 
sequence of the PRO polypeptide - encoding nucleic acid molecule of interest having a sequence derived from 
the native sequence PRO polypeptide - encoding nucleic acid (/.*., the reference sequence) and the comparison 
nucleic acid molecule of interest (i.e., the sequence against which the PRO polypeptide - encoding nucleic acid 

18 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCT/US00/05841 



molecule of interest is being compared - which may be a PRO variant polynucleotide) as determined by WTJ- 
BLAST-2 by (b) the total number of nucleotides of the PRO reference sequence. For example, in the statement 
"an isolated nucleic acid molecule comprising a nucleic acid sequence A which has or having at least 80% 
nucleic acid sequence identity to the nucleic acid sequence B'\ the nucleic acid sequence A is the comparison 
nucleic acid molecule of interest and the nucleic acid sequence B is the nucleic acid sequence of the PRO 
polypeptide - encoding nucleic acid molecule of interest. 

Percent nucleic acid sequence identity may also be determined using the sequence comparison 
program NCBI-BLAST2 (Altschui at ai t Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 
sequence comparison program may be downloaded from " http://www.ncbi.nim.nih. gov" or otherwise obtained 
from the National Institute of Heath. Bethesda. MD. NCBI-BLAST2 uses several search parameters, wherein 
ail of those search parameters are set to default values including, for example, unmask =■ yes, strand = ail. 
expected occurrences = 10. minimum low complexity length = 15/5. multi-pass e-value = 0.01. constant for 
muiti-pass = 25. dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to. with, or against a given nucleic add sequence D 
(which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to. with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI- 
BLAST2 in that program's alignment of C and D. and where Z is the total number of nucleotides in D. It will 
be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid 
sequence D. the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity 
ofDtoC. 

In other embodiments. PRO variant polynucleotides are nucleic acid molecules that encode an active 
PRO polypeptide and which arc capable of hybridizing, preferably under stringent hybridization and wash 
conditions, to nucleotide sequences encoding a full-length PRO polypeptides as disclosed herein. PRO variant 
polypeptides may be those that are encoded by a PRO variant polynucleotide. 

The term "positives", in the context of sequence comparison performed as described above, includes 
residues in the sequences compared that are not identical but have similar properties (e.*.. as a result of 
conservative substitutions, see Table 6 below). For purposes herein, the % value of positives is determined by 
dividing (a) the number of amino acid residues scoring a positive value between the PRO polypeptide sequence 
of interest having a sequence derived from a native sequence PRO polypeptide and the comparison amino acid 
sequence of interest (i.e.. the amino acid sequence against which the PRO polypeptide sequence is being 
compared) as determined in the BLOSUM62 matnx of WU-BLAST-2 by (b) the total number of amino acid 
residues of the PRO polypeptide of interest. 

Unless specifically stated otherwise, the % value of positives is calculated as described in the 
immediately preceding paragraph. However, in the context of the amino acid sequence identity comparisons 
performed as described for ALIGN- 2 and NCBI-BLAST-2 above, includes amino acid residues in the 
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sequences compared that are not only identical, but also those that have similar properties. Amino acid 
residues that score a positive value to an amino acid residue of interest are those that are either identical to the 
amino acid residue of interest or are a preferred substitution (as defined in Table I beJow) of the amino acid 
residue of interest. 

For amino acid sequence comparisons using ALIGN- 2 or NCBI-BLAST2. the % value of positives of 
a given amino acid sequence A to. with, or against a given amino acid sequence B (which can altemativeiy be 
phrased as a given amino acid sequence A that has or comprises a certain % positives to. with, or against a 
given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scoring a positive value as defined above by the sequence 
alignment program ALIGN- 2 or NCBI-BLAST2 in that program's alignment of A and B, and where Y is the 
total number of ammo acid residues in B. It will be appreciated that where the length of amino acid sequence 
A is not equal to the length of amino acid sequence B. the % positives of A to B will not equal the % positives 
ofBtoA. 

"Isolated." when used to describe the various polypeptides disclosed herein, means polypeptide that 
has been identified and separated and/or recovered from a component of its natural environment. Contaminant 
components of its natural environment are materials that would typically interfere with diagnostic or 
therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non- 
proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (i) to a degree sufficient to 
obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator. 
or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or. 
preferably, silver stain. Isolated polypeptide includes polypeptide in situ within recombinant ceils, since at 
least one component of the PRO polypeptide in its natural environment will not be present. Ordinarily, 
however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide - encoding nucleic acid or other polypeptide-encoding nucleic acid is 
a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule 
with which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An 
isolated polypeptide-encoding nucleic acid molecule is other than in the context or setting in which it is found 
in nature. Isolated polypeptide - encoding nucleic acids therefore are distinguished from the polypeptide - 
encoding nucleic acid molecule existing in natural ceils. However, an isolated PRO polypeptide - encoding 
nucleic acid molecule includes the same contained in cells that ordinarily express the specific polypeptide 
where, for example, the nucleic acid molecule is in a chromosomal location different from that of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic 
cells are known to utilize promoters, polyadenyiation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
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polypeptide if it is expressed as a preprotem that participates in the secretion of the polypeptide: a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence: or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and. in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the an, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt 
concentration. In general, longer probes require higher temperatures for proper annealing, while shorter probes 
need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, the higher the relative temperature which 
can be used: As a result, it follows that higher relative temperatures would tend to make the reaction 
conditions more stringent, while lower temperatures less so. For additional details and explanation of 
stringency of hybridization reactions, see Ausubei ei at.. Current Protocols in Molecular Biology. Wiley 
Interscience Publishers. ( 1 995). 

"Stringent conditions" or "high stringency conditions." as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
chioride-0.0015 M sodium citraie/0.1% sodium dodecyl sulfate at 50 G C; (2) employ during hybridization a 
denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum 
albumin/0.1% Ficoll/0.1% polyvmylpyrroiidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride. 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl. 0.075 
M sodium citrate). 50 mM sodium phosphate (pH 6.8). 0.1% sodium pyrophosphate. 5 x Dcnhardfs solution, 
sonicated salmon sperm DNA (50 ug/mi). 0. 1% SDS. and 10% dextran sulfate at 42°C. with washes at 42°C in 
0.2 x SSC (sodium chloride/sodium citrate i and 50% formamide at 55°C. followed by a high-stringency wash 
consisting of 0. 1 x SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et aL Molecular 
Cloning: A Laboratory Manual. New York: Cold Spring Harbor Press. 1989. and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation at 37°C in a solution 
comprising: 20% formamide. 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5 x Denhardrs solution. 10% dextran sulfate, and 20 ug/mL denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc., as necessary to accommodate factors such as probe length and the like. 

"Antibodies" (Abs) and "immunoglobulins" (Igs) are glycoproteins having the same general structural 
characteristics. While antibodies exhibit binding specificity to a specific antigen, immunogiobuiins include 
both antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides of the latter 
kind arc, for example, produced at low levels by the lymph system and at increased levels by myelomas. The 
term "antibody" is used in the broadest sense and specifically covers, without limitation, intact monoclonal 
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antibodies (including agonist, antagonist and neutralizing antibodies), polyclonal antibodies, multispecific 
antibodies (e.g.. bispecific antibodies) formed from at least two intact antibodies, single chain antibodies 
binding the epitopes specific to the PRO polypeptide and antibody fragments so long as they exhibit the desired 
biological activity. An anti-PRO200. anti-PRO204. anti-PR02I2, anti-PR0216, anti-PR0226, anti-PRO240, 
anti-PR0235, am>PR0245, anti-PR0172. ami-PR0273, anti-PR0272, anti-PR0332. anti-PR0526, anti- 
PRO701, anti-PR0361, anti-PR0362, anti-PR0363. anti-PR0364. anti-PR0356. anti-PR053l. anti-PR0533, 
anti-PROI083. anti-PR0865. anti-PRO770. anti-PR0769, anti-PR0788. anti-PROl 1 14, anti-PRO1007. ami- 
PR01134. anti-PROI03l, anti-PROI346. ami-PROl!55 f anti-PROt250, anti-PRO!312, anti-PROH92. ana- 
PR01246, anti-PROI283. anti-PROl 195. anti-PRO!343, ami-PROI4I8. anti-PR01387. anti-PR014lO. anti- 
PROl 9 17. anti-PROl 868, anu-PRO205 r ami-PR021. anti-PR0269, anti-PR0344. anti-PR0333, anti-PR0381, 
anti-PRO720, anti-PR0866, anti-PRO840. anti-PR0982. anti-PR0836, anti-PR0U59. ami-PROI358. ami- 
PROI325. anti-PRO!338, anti-PRO!434. anti-PR04333, anti-PRO4302, anti-PRO4430 or anti-PR05727 
antibody is an antibody which immunologically binds to a PRO200. PRO204. PR0212, PR0216. PR0226, 
PRO240. PR0235. PR0245. PROI72, PR0273. PR0272. PR0332, PR0526, PRO70I, PR0361. PR0362. 
PR0363. PR0364. PR0356. PR053 I..PR0533. PROI083. PR0865. PRO770. PR0769. PR0788. PRO! 1 14. 
PRO 1 007, PRO 11 84. PRO 103 I, PRO 1346. PRO II 55. PRO 1250. PRO 13 12. PRO 1192. PRO 1246. PRO 1283. 
PRO! 195. PR01343. PR01418. PR01387. PRO1410. PROI917, PR01868. PRO205. PR021. PR0269, 
PR0344. PR0333, PR0381. PRO720. PR0866. PRO840. PR0982. PR0836. PROi 159. PR01358. PR01325. 
PR01338. PR01434, PR04333, PRO4302. PRO4430 or PR05727, respectively, polypeptide. The antibody 
may bind to any domain of the PRO polypeptide which may be contacted by the antibody. For example, the 
antibody may bind to any extracellular domain of the polypeptide and when the entire polypeptide is secreted, 
to any domain on the polypeptide which is available to the antibody for binding. 

"Native antibodies" and "native immunoglobulins" are usually heterotetrameric glycoproteins of about 
150,000 daltons. composed of two identical light (L) chains and two identical heavy (H) chains. Each light 
chain is linked to a heavy chain by one covaiem disulfide bond, while the number of disulfide linkages vanes 
among the heavy chains of different immunoglobulin isorypes. Each heavy and light chain also has regularly 
spaced intrachain disulfide badges. Each heavy chain has at one end a variable domain (V M ) followed by a 
number of constant domains. Each light chain has a variable domain at one end (V L ) and a constant domain at 
its other end: the constant domain of the light chain is aligned with the first constant domain of the heavy 
chain, and the light-chain variable domain is aligned with the variable domain of the heavy chain. Particular 
amino acid residues are believed to form an interface between the light- and heavy-chain variable domains. 

The term "variable" refers to the fact that certain portions of the variable domains differ extensively in 
sequence among antibodies and are used in the binding and specificity of each particular antibody for its 
particular antigen. However, the variability is not evenly distributed throughout the variable domains of 
antibodies. It is concentrated in three or four segments called "complementarity-determining regions" (CDRs) 
or "hypervariable regions" in both in the light-chain and the heavy-chain variable domains. The more highly 
conserved portions of variable domains are called the framework (FR). The variable domains of native heavy 
and light chains each comprise four or five FR regions, largely adopting a p-sheet configuration, connected by 
the€DRs, which form loops connecting, and in some cases forming part of, the (5-sheet structure. The CDRs 
in each chain are held together in close proximity by the FR regions and with the CDRs from the other chain, 
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contribute to the formation of the antigen-binding site of antibodies <see Kabat et aL NIH Publ. No.91-3242, 
Vol. I, pages 647-669 (1991)). There are at least two techniques for determining the extent of the CDRs: (1) 
An approach based on the extent of cross-species sequence variability (i.e.. Kabat et aL Sequences of Proteins 
of Immunological Interest (National Institute of Health. Bethesda. MD); and (2) an approach based on 
crystallographic studies of antigen-antibody complexes (Chothia. C. et aL, (1989), Nature 342: 877). 
Moreover. CDR's can also be defined using a hybrid approach incorporating the rescues identified by both of 
the previous techniques. The constant domains arc not uivolved directly in binding an antibody to an antigen, 
but exhibit various effector functions, such as participation of the antibody in antibody-dependent cellular 
toxicity. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab. Fab'. F(ab') 2 , and Fv 
fragments: diabodies: linear antibodies (Zapata ct aL. Protein Eng. 8 (10): 1057-1062 [1995]); single-chain 
antibody molecules: and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces rwo identical antigen- binding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual "Fc" fragment, whose name reflects its ability 
to crystallize readily. Pepsin treatment yields an Rab'b fragment that has two antigen-combining sites and is 
still capable of cross-linking antigen. 

. "Fv" is the minimum antibody fragment which contains a complete antigen-recognition and binding 
site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to define an 
antigen-binding sue on the surface of the V H -V L dimer. Collectively, the six CDRs confer antigen-binding 
specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than 
the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab 1 fragments differ from Fab fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab") 2 antibody fragments originally were produced as pairs of Fab' fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to 
one of two clearly distinct types, called kappa (k) and lambda (X). based on the amino acid sequences of their 
constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, 
IgD, IgE IgG, and IgM, and several of these may be further divided into subclasses (isorypes), e.g.. IgGl, 
IgG2, IgG3, IgG4, IgA, and IgA2. The heavy-chain constant domains that correspond to the different classes 
of immunoglobulins are called a, 5, e, y, and u, respectively. The subunit structures and three-dimensional 
configurations of different classes of immunoglobulins are well known. 
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The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e.. the individual antibodies comprising the population are ideniicaJ 
except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal 
antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to 
conventional (polyclonal) antibody preparations which typically include different antibodies directed against 
different determinants (epitopes), each monocJonal antibody is directed against a single determinant on the 
antigen. In addition to their specificity, the- monoclonal antibodies are advantageous in that they are 
synthesized by the hybridoma culture, unconiaminated by other immunoglobulins. The modifier "monoclonal" 
indicates the character of the antibody as being obtained from a substantially homogeneous population of 
antibodies, and is not to be construed as requiring production of the antibody by any particular method. For 
example, the monoclonal antibodies to be used in accordance with the present invention may be made by the 
hybridoma method first described by Kohler et ai. Nature, 256: 495 [1975], or may be made by recombinant 
DNA methods (see, e.g., U.S. Patent No. 4.816,567). Hie "monoclonal antibodies" may also be isolated from 
phage antibody libraries using the techniques described in Clackson a ai. Nature. 352:624-628 [1991) and 
Marks et ai. J. Mol. Biol.. 222:581-597 (1991). for example. See also Lf.S Patent Nos. 5.750,373. 5,571.698, 
5.403.484 and 5.223.409 which describe the preparation of antibodies using phaeemid and phage vectors. 

The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) in 
which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in 
antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the 
remainder of the chain's) is identical with or homologous to corresponding sequences in antibodies derived 
from another species or belonging to another antibody class or subclass, as well as fragments of such 
antibodies, so long as they exhibit the desired biological activity (U.S. Patent No. 4.816,567: Morrison et ai, 
Proc. Natl. Acad. Sci. USA. BV.6S5 1 -6855 [ 1 984)). 

"Humanized" forms of non-human (e.g.. murine) antibodies arc chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab 1 . F(ab*) 2 or other antigen-binding 
subsequences of antibodies* which contain minimal sequence derived from non-human immunoglobulin. For 
the most pan. humanized antibodies arc human immunoglobulins (recipient antibody) in which residues from a 
complementarity-determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity, and 
capacity. In some instances. Fv framework region (FR) residues of the human immunoglobulin are replaced by 
corresponding non-human residues. Furthermore, humanized antibodies may comprise residues which are 
found neither in the recipient antibody nor in the imported CDR or framework sequences. These modifications 
are made to further refine and maximize antibody performance. In general, the humanized antibody will 
comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all 
of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR 
regions are those of a human immunoglobulin sequence. The humanized antibody optimally also will 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin. For further details, see Jones et ai t Nature, 321:522-525 (1986); Reichmann et aL. Nature, 
332:323-329 [1988]; and Presta, Curr. Op. Struct. Bioi, 2:593-596 (1992). The humanized antibody includes a 
"primatize^antibody where the antigen-binding region of the antibody is derived from an antibody produced 
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by immunizing macaque monkeys with the antigen of interest. Antibodies containing residues from Old World 
monkeys are also possible within the invention. See, for example, U.S. Patent Nos. 5,658,570; 5,693,780; 
5,681,722; 5,750,105; and 5,756,096. 

Antibodies and fragments thereof in this invention also include "affinity matured" antibodies in which 
an antibody is altered to change the amino acid sequence of one or more of the CDR regions and/or the 
framework regions to alter the affinity of the antibody or fragment thereof for the antigen to which it binds. 
Affinity maturation may result in an increase or in a decrease in the affinity of the matured antibody for the 
antigen relative to the starting antibody. Typically, the starting antibody will be a humanized, human, chimeric 
or murine antibody and the affinity matured antibody wiil have a higher affinity than the starting antibody. 
During the maturation process, one or more of the amino acid residues in the CDRs or in the framework 
regions are changed to a different residue using any standard method. Suitable methods include point 
mutations using well known cassette mutagenesis methods (Wells et at., 1985. Gene 34:315) or oligonucleotide 
mediated mutagenesis methods (ZoIIer et a/., 1987, Nucleic Acids Res. ]0: 6487-6504). Affinity maturation 
may also be performed using known selection methods in which many mutations are produced and mutants 
having the desired affinity are selected from a pool or library of mutants based on improved affinity for the 
antigen or ligand. Known phage display techniques can be conveniently used in this approach. See. for 
example, U.S. 5,750,373; U.S. 5.223,409, etc. 

Human antibodies are also with in the scope of the antibodies of the invention. Human antibodies can 
be produced using various techniques known in the an, including phage display libraries [Hoogenboom and 
Winter. Mot. Biol., 227:381 (1991); Marks et ai,J. Mot. Biol., 222:581 (1991)]. The techniques of Cole et 
al. and Boerner et ai. are also available for the preparation of human monoclonal antibodies (Cole et aL 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss. p. 77 (1985); Boerner et aL J. Immunol.' Ul 
£0:86-95 (1991); U. S. 5,750, 373]. Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have 
been partially or completely inactivated. Upon challenge, human antibody production is observed, which 
closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody 
repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5.545.806; 5,569,825; 
5,625,126; 5,633,425; 5,661,016. and in the following scientific publications: Marks etai. Bio/Technology ]0. 
779-783 (1992); Lonberg et aL Mature 368 856-859 (1994); Morrison, Nature 368, 812-13 (1994); Fishwild 
et aL Nature Biotechnology \±. 845-51 (1996); Neuberger, Nature Biotechnology \4. 826 (1996): Lonberg and 
Huszar. intern. Rev. Immunol \3 65-93 (1995). 

"Single-chain Fv" or "sFv" antibody fragments comprise the V M and V L domains of antibody, wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between the V H and domains which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 1 13, 
Rosenburg and Moore eds., Springer- Verlag, New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (VjJ in 
the same polypeptide chain (V H - V L ). By using a linker that is too short to allow pairing between the two 
domains on the same chain, the domains are forced to pair with the complementary domains of another chain 
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and create two antigen-binding sites. Diabodies are described more fully in. for example, EP 404,097: WO 
93/1 1161: and Hollinger et aL. Proc. Natl. Acad. ScL USA, 90:6444-6448 (1993). 

The word "label" when used herein refers to a detectable compound or composition which is 
conjugated directly or indirectly to the compound, e.g., antibody or polypeptide, so as to generate a "labelled" 
5 compound. The label may be detectable by itself (e.g., radioisotope labels or fluorescent labels) or. in the case 
of an enzymatic label, may catalyze chemical alteration of a substrate compound or composition which is 
detectable. 

By "solid phase" is meant a non-aqueous matrix to which the compound of the present invention can 
adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g., 

10 controlled pore glass), polysaccharides (e.g., agarose), polyacryiamides, polystyrene, polyvinyl alcohol and 
silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay 
plate; in others it is a purification column {e.g., an affinity chromatography column). This term also includes a 
discontinuous solid phase of discrete panicles, such as those described in U.S. Patent No. 4.275,149. 

The term "immune related disease" means a disease in which a component of the immune system of a 

1 5 mammal causes, mediates or otherwise contributes to a morbidity in the mammal. Also included are diseases 
in which stimulation or intervention of the immune response has an ameliorative effect on progression of the 
disease. Included within this term are immune-mediated inflammatory diseases, non-immune-mediated 
inflammatory diseases, infectious diseases, immunodeficiency diseases, neoplasia, etc. 

The term "T ceil mediated" disease means a disease in which T cells directly or indirectly mediate or 

20 otherwise contribute to a morbidity in a mammal. The T cell mediated disease may be associated with ceil 
mediated effects, lymphokine mediated effects, etc., and even effects associated with B cells if the B cells are 
stimulated, for example, by the lymphokines secreted by T cells. 

Examples of immune-related and inflammatory diseases, some of which are immune or T eel! 
mediated, which can be treated according to the invention include systemic lupus erythematosis. rheumatoid 

25 arthritis, juvenile chronic arthritis, spondyloarthropathies, systemic sclerosis (scleroderma), idiopathic 
inflammatory myopathies idcrmatomyositis. polymyositis). Sjogren's syndrome, systemic vasculitis, 
sarcoidosis, autoimmune hemolytic anemia (immune pancytopenia, paroxysmal nocturnal hemoglobinuria), 
autoimmune thrombocytopenia (idiopathic thrombocytopenic purpura, immune- mediated thrombocytopenia), 
thyroiditis (Grave's disease, Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, atrophic thyroiditis), 

30 diabetes mellitus. immune-mediated renal disease (glomerulonephritis, tubulointerstitial nephritis), 
demyeiinating diseases of the central and peripheral nervous systems such as multiple sclerosis, idiopathic 
demyelinating polyneuropathy or Guillain-Barre syndrome, and chronic inflammatory demyeiinating 
polyneuropathy, hepatobiliary diseases such as infectious hepatitis (hepatitis A, B, C ( D, E and other non- 
hepatotropic viruses), autoimmune chronic active hepatitis, primary biliary cirrhosis, granulomatous hepatitis. 

35 and sclerosing cholangitis, inflammatory bowel disease (ulcerative colitis: Crohn's disease), gluten-sensitive 
enteropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases including bullous skin 
diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, allergic 
- rhinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the lung such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and hypersensitivity pneumonitis, transplantation 

40 associated diseases including graft rejection and graft -versus-host-disease. Infectious diseases including viral 
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diseases such as AIDS (HIV infection), hepatitis A. B, C, D. and E. herpes, etc.. bacterial infections, fungal 
infections, protozoal infections and parasitic infections. 

"Treatment" is an intervention performed with the intention of preventing the development or altering 
the pathology of a disorder. Accordingly, "treatment" refers to both therapeutic treatment and prophylactic or 
5 preventative measures. Those in need of treatment include those already with the disorder as well as those in 
which the disorder is to be prevented. In treatment of an immune related disease, a therapeutic agent may 
directly decrease or increase the magnitude of response of a component of the immune response, or render the 
disease more susceptible to treatment by other therapeutic agents, e.g.. antibiotics, antifungals, anti- 
inflammatory agents, chemotherapeutics. etc. 

10 The term "effective amount" is at least the minimum concentration or amount of a PRO polypeptide 

and/or agonist/antagonist which causes, induces or results in either a detectable improvement in a component 
of the immune response in mammals as measured in an in vitro assay. For example, an increase or decrease in 
the proliferation of T-cclls and/or vascular permeability as measured in Examples provided herein. 
Furthermore, a "therapeutically effective amount" is the minimum concentration or amount of a PRO 

15 polypeptide and/or agonist/antagonist which would be effective in at least attenuating a pathology (increasing 
or decreasing as the case may be) a component of the immune response in mammals, the results of which 
effects a treatment as defined in the previous paragraph. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to 
an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 

20 "intcrmiiienfc" administration is treatment that is not consecutively done without interruption, but rather is 
cyclic in nature. 

The "pathology" of an immune related disease includes all phenomena that compromise the well- 
being of the patient. This includes, without limitation, abnormal or uncontrollable cell growth, antibody 
production, auto-antibody production, complement production and activation, interference with the normal 
25 functioning of neighboring cells, release of cytokines or other secretory products at abnormal levels, 
suppression or aggravation of any inflammatory or immunological response, infiltration of inflammatory cells 
(neutrophilic, eosinophilic, monocytic, lymphocytic) into tissue spaces, etc. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 
domestic and farm animals, and zoo. sports, or pet animals, such as dogs, horses, cattle, pigs, apes, hamsters, 
30 ferrets, cats, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which 
are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
35 the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as 
glycine, glutaraine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates 
40 including glucose, mannose. or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
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sorbitol: salt- forming countcrions such as sodium; and/or nonionic surfactants such as TWEEN™, 
polyethylene glycol (PEG), and PLURONICS™. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function of 
cells and/or causes destruction of cells. The term is intended to include radioactive isotopes {e.g., I m . I 125 , Y 90 
5 and Re 186 ), chemotherapeutic agents, and toxins such as enzymatically active toxins of bacterial, fungal, plant 
or animal origin, or fragments thereof. 

A "chemotherapeutic agent" is a chemical compound useful in the treatment of cancer. Examples of 
chemotherapeutic agents include adriamycin, doxorubicin, epirubicin. 5-fluorouracil, cytosine arabinoside 
("Ara-C"), cyclophosphamide, thiotepa, busulfan. cytoxin. taxoids. e.g.. paclitaxel (Taxol. Bristol-Myers 

10 Squibb Oncology, Princeton. NJ). and doxetaxel (Taxotere, Rhone-Poulenc Rorer, Antony. France), toxotere, 
methotrexate, cisplatin, melphalan. vinblastine, bleomycin, etoposide. ifosfamide. mitomycin C. mitoxantrone. 
vincristine, vinorelbine. carboplatin, teniposide. daunomycin. carminomycin. aminopterin. dactinomycin. 
mitomycins, esperamicins (sec U.S. Pat. No. 4,675,187), melphalan and other related nitrogen mustards. Also 
included in this definition are hormonal agents that act to regulate or inhibit hormone action on tumors such as 

15 tamoxifen and onapristone. 

A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits 
growth of a cell, especially cancer cell overexpressing any of the genes identified herein, either in vitro or in 
vivo. Thus, the growth inhibitory agent is one which significantly reduces the percentage of cells 
overexpressing such genes in S phase. Examples of growth inhibitory agents include agents that block cell 

20 cycle progression (at a place other than S phase), such as agents that induce Gi arrest and M-phase arrest. 
Classical M-phase blockers include the vincas (vincristine and vinblastine), taxol, and topo II inhibitors such as 
doxorubicin, epirubicin. daunorubicin. etoposide. and bleomycin. Those agents that arrest G 1 also spill over 
into S-phase arrest, for example, DNA alkylating agents such as tamoxifen, prednisone, dacarbazine. 
mechlorethamine. cisplatin. methotrexate. 5-fluorouracil, and ara-C. Further information can be found in Vie 

25 Molecular Basis of Cancer. Mendelsohn and Israel, eds.. Chapter 1. entitled "Cell cycle regulation, oncogens, 
and antineoplastic drugs" by Murakami ct al. (WB Saunders: Philadelphia. 1995). especially p. 13. 

The term "cytokine" is a generic term for proteins released by one cell population which act on 
another ceil as intercellular mediators. Examples of such cytokines are lymphokines. monokines, and 
traditional polypeptide hormones. Included among the cytokines are growth hormone such as human growth 

30 hormone, N-methionyl human growth hormone, and bovine growth hormone: parathyroid hormone; thyroxine; 
insulin; proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), 
thyroid stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor, fibroblast growth 
factor, prolactin; placental lactogen; tumor necrosis factor-a and -P; mullerian- inhibiting substance: mouse 
gonadotropin-associated peptide: inhibin; activtn; vascular endothelial growth factor, integrin; thrombopoietin 

35 (TPO); nerve growth factors such as NGF-0; platelet-growth factor, transforming growth factors (TGFs) such 
as TGF-a and TGF- (3; insulin-like growth factor-I and -II; erythropoietin (EPO); osteoinductive factors; 
interferons such as interferon- a, -0, and -y; colony stimulating factors (CSFs) such as macrophage-CSF (M- 
CSF); granulocyte-macrophage-CSF (GM-CSF); and granulocyte-CSF (G-CSF); interleukins (ILs) such as IL- 
t, IL-la, IL-2, IL-3, IL^, IL-5, IL-6, IL-7. IL-8, IL-9, IL-l 1, 1L-12; a tumor necrosis factor such as TOF-a or 

40 TNF-P; and other polypeptide factors including LIF and kit ligand (KL). As used herein, the term cytokine 
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includes proteins from natural sources or from recombinant cell culture and biologically active equivalents of 
the native sequence cytokines. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope 
against which an antibody can be made, yet is short enough such that it does not interfere with activity of the 
polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does 
not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino 
acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 
amino acid residues). 

"Active'-' or "activity" in the context of variants of the PRO polypeptide refers to form(s) of proteins of 
the invention which retain the biologic and/or the ability to induce the production of an antibody against an 
antigenic epitope possessed by the PRO polypeptide. More specifically, "biological activity" refers to a 
biological function (either inhibitory or stimulatory) caused by a native sequence or naturally-occurring PRO 
polypeptide. Even more specifically, "biological activity" in the context of an antibody or another molecule 
that can be identified by the screenuig assays disclosed herein {e.g.. an organic or inorganic small molecule, 
peptide, c/c) can be the ability of such molecules to induce or inhibit infiltration of inflammatory ceils into a 
tissue, to stimulate or inhibit T-cell proliferation or activation, to stimulate or inhibit cytokine release by ceils 
or to increase or decrease vascular permeability. Another specific biological activity is the increased vascular 
permeability or the inhibition thereof. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
blocks, inhibits, or neutralizes a biological activity of a native sequence PRO polypeptide disclosed herein. In 
a similar manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics or 
amplifies a biological activity of a native sequence PRO polypeptide disclosed herein. Suitable agonist or 
antagonist molecules specifically include agonist or antagonist antibodies or antibody fragments, fragments or 
amino acid sequence variants of native PRO polypeptides, peptides, small organic molecules, etc. Methods for 
identifying agonists or antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a 
candidate agonist or antagonist molecule and measuring a detectable change in one or more biological 
activities normally associated with the same. 

A "small molecule" is defined herein to have a molecular weight below about 600 daltons. and is 
generally an organic compound. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is useful for delivery of a drug (optionally including a chemotherapeutic agent) to a mammal. The 
components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of 
biological membranes. 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector . functions of immunoglobulin 
constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (*.<>., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
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iigand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-I, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-i and IgA-2). I&E, IgD 
orlgM. 
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Table 1 

/* 
* 

* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is M: stop-stop = 0: J (joker) match = 0 
V 

^define _M -8 /* value of a match with a stop */ 



int _day[26][26] = { 

/• ABCDEFGHIJKLMNOPQRS. TUVWXYZ*/ 

/• A */ { 2. 0.-2. 0. 0.-4. 1.-1,-1, 0.-1.-2.-1. 0. M. 1. 0.-2. I. I. 0. 0 -6 0 -3 0} 

/* B •/ { 0. 3M. 3. 2.-5. 0. 1.-2. 0. 0.-3.-2. 2.~M.-1, 1. 0. 0. 0. 0,-2 -5. 0-3 1} 

15 /• C */ {-2.-4.-i5.-5.-5.-4.-3.-3.-2. 0.-3.-6.-5.-47 M.-3.-5.-4. 0.-2, 0,-2.-8, 0, 0.-5}. 

/*DV {0,3,-5.4. 3.-6. 1. 1.-2,0,0.-4.-3, 2._M.-I. 2.-1, 0.0, 0,-2 -7 0 -4 2} 

f m E*l {0.2,-5,3.4.-5,0.1.-2.0,0.-3.-2.1. M.-1 . 2.-1. 0, 0, 0 -7 0 -4 3} 

/* F */ {-4,-5.-4.-6.-5. 9.-5.-2. 1. 0.-5. 2. 0,-47m.-5.-5.-4.-3.-3, 0 -10 0 7-5} 

/*G«/ { 1, 0.-3. I. 0.-5. 5.-2.-3. 0,-2.-4.-3. 0.~M.-I.-l.-3. I. 0, 0.-1 -7 0-5 0}' 

20 fttV {-1. 1.-3. 1. .1.-2.-2. 6.-2. 0.0,-2.-2. 2.~M. 0.3. 2.-1.-1. 0.-2.-3. 0 0 2} 

/• I •/ {-1.-2^2.-2.-2, 1.-3,-2, 5. 0,-2. 2. 2.-2,~M.-2.-2.-2,-I. 0. 0. 4.-5 o'.-l'-2} 

/* J */ { 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0, 0. 0. 0. M. 0. 0. 0. 0. 0. 0 0 0 0 0 0} 

/*K*/ {-I. 0.-5. 0. 0.-5.-2. 0.-2. 0. 5.-3. 0, i7m.-I. 1 . 3. 0. 0. 0.-2 -3 0-4 0} 

/* L •/ {-2.-3.-6.-4.-3. 2.-4.-2. 2. 0.-3, 6. 4.-3.^.-3.-2.-3.-3.-1. 0 2-70-1 -?} 

/• M V {-I. -2.-5.-3.-2. 0.-3.-2. 2. 0. 0, 4. 6.-2. ~M.-2.-I. 0.-2 -1 0 2 -4 0 -2 -1} 

/• N •/ { 0, 2,-4. 2. 1.-4. 0. 2.-2. 0. 1.-3.-2. 2JM.-1. 1. 0. 1. 0. 0.-2.-4. 0.-2. I}. 

/*0 •/ {JA.JA.JA.Jd.JA.JM.JA.JA.JM.jA. M. M. M, M. 0. M. M. M._M. M. M. M. M. M. M. M). 

/•P*/ { 1.-1.-3.-H-1.-5.-1. 0.-2. 0.-l.-3,-2,-l._M.o7o7o. 1.0, 0~.-l."-6. 0,-5* 0}. " 

/*QV {0, 1,-5.2.2.-5.-1.3,-2.0, I.-2.-1. 1. M. 0. 4. 1.-1.-1.0-^-5 0^3} 

/• R •/ {-2. 0.-4.-1. -1.-4.-3. 2.-2, 0. 3,-3. 0. 0."m. 0. 1. 6. 0,-1. 0,-2, 2. 0 -4 0}' 

/•S*/ { 1,0.0.0.0.-3. l.-l.-K 0.0.-3,-2. !.~M. I.-i. 0. 2. 1.0.-1.-2.0,-3 0} 

/•T-/ { 1,0,-2.0.0.-3.0.-1.0.0.0.-1.-1,0. M. 0.-1.-1. 1. 3.0. 0.-5 0 -3 0} 

/• U •/ { 0, 0. 0. 0. 0. 0. 0. 0. 0. 0, 0. 0, 0. 0,_M. 0. 0. 0. 0. 0 0 0 0 0 0 0} 

/* V •/ { 0.-2.-2.-2.-2.-1.-1.-2. 4. 0.-2. 2. 2.-2. M.-1.-2.-2,-!. 0, 0. 4,-6. 0 -2 -2} 

/* W */ {-6.-5.-8.-7.-7. 0.-7,-3.-5. 0.-3.-2.-4.-4._M.-6.-5. 2.-2.-5. 0.-6. 17. 0. 0.-6}. 

/* X •/ { 0, 0. 0. 0. 0, 0. 0. 0. 0. 0. 0, 0, 0. 0. M. 0. 0. 0. 0. 0. 0, 0. 0. 0 0 0} 

/• Y */ {-3.-3. 0.-4.-4. 7.-5. 0.-1. 0.-4.-1.-2.-27 M.-5.-4.-4.-3.-3. 0.-2. 0. 0 10 -4} 

fZV {0.1.-5. 2. 3.-5.0. 2.-2,0.0.-2.-1. 1. *M. 0. 3. 0. 0. 0.0,-2 -6 0 -4 4} 
}: " • • . - J 
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^include <stdio.h> 




^include <crype.h> 




Adeline 


MAXJMP 


16 


^define 


MAXGAP 


24 


^define 


JMPS 


1024 


^define 


MX 


4 


#define 


DMAT 


3 


^define 


DMIS 


0 


//define 


DINSO 


8 


^define 


DINS1 ' 


I 


//define 


PINSO 


8 


^define 


PINS I 


4 


struct jmp { 





/* max jumps in a diag */ 

/♦don't continue to penalize caps larger than this V 
/* max jmps in an path */ 

/* save it' there's at least MX-1 bases since last jmp */ 

/* value of matching bases */ 

/* penalty tor mismatched bases •/ 

/* penalty tor a gap */ 

/* penalty per base •/ 

/* penalty tor a gap */ 

/* penalty per residue */ 



short 

unsigned short 



struct diuu { 
int 



long 
short 
struct jmp 



n(MAXJMP]; 
x(MAXJMP): 



score; 
offset; 
ijmp; 
jp: 



/* size of jmp (neg for dely) */ 
/* base no. of jmp in seq x */ 
/* limits seq to 2* 16 - I V 

/* score at fast jmp */ 
/* offset of prev block */ 
/* current jmp index */ 
/* list of jmps */ 



35 }; 



int 

short 

int 



spc; 



/* number of leading spaces */ 
nfJMPSl:/* size of jmp (gap; */ 
xfJMPS];/* loc of jmp (last eiem before gap) */ 



char 

char 

char 

char 

int 

int 

int 

int 

int 

int 

int 

int 

int 

long 

struct 

struct 

char 
char 



diag 
path 



•ofile; 

•namexf2|; 

*prog: 

•seqx(2|; 

dmax: 

dmaxO: 

dna: 

endgaps: 
gapx. gapy; 
lenO. leni: 
ngapx, ngapy: 
smax; 
•xbtn: 
offset; 
*dx: 
PP(2]; 



1 output file name */ 
1 seq names: getseqsf) •/ 
1 prog name for err msgs */ 
1 seqs: getseqsi ) */ 
best diag: nwi ) ♦/ 
final diag V 
set if dna: main() */ 
set if penalizing end gaps */ 
total gaps in seqs */ 
seq lens */ 
total size of gaps */ 
max score: nw() •/ 
bitmap for matching */ 
current offset in jmp file */ 
holds diagonals */ 
holds path for seqs */ 



♦callocO. *maiioc(), 'indexO. *strcpy(); 
*getseq(). *g_calloc(); 
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I* Needleman-Wunsch alignment program 
* 

* usage: progs file I fi!e2 

5 * where file I and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ' : \ ' > ' or ' < ' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 
10 * Output is in the file "align. out" 

* The program may create a imp file in Amp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
V 

15 ^include nw.h^ 
^include "dav.h" 



20 



static jJbvalf26J = { 

1,14.2.13.0,0.4,11,0.0.12,0.3.15.0,0,0.5.6.8,8.7.9.0.10,0 



static _pbvalf261 = { 

I. 2 UK <f'D , - , A , ))|(I< <fN , - , A')). 4. 8. 16, 32. 64. 
128. 256. OxFFFFFFF. 1< < 10. 1< < II. !< < 12. 1< < 13. K < 14. 
25 l< < 15. 1 < < 16. K < 17. I < < 18. I < < 19. t < <20. t <*<2I. 1 < <22. 

I < <23. K < 24. 1< <25|(I < <CE , - , A'))|( 1< <rQ'-*A')) 



main(ac. av) 
30 main 

int 2£i 
char *av(J; 

{ 

prog = av(01; 
35 iHac!=3){ 

fjprinttfstderr, "usage: %s ftlel File2\n\ prog); 

fprinitfstderr. "where txlel and file2 are two dna or two protein sequences. \n"): 
fprimtfstderr.The sequences can be in upper- or lower-caseVn"); 
fjprimtfstdcrr. "Any lines beginning with or ' < * are ignored\n"): 
40 fcrimftsiderr. "Output is in the file v"alien.out\"\n"): 

exit(I); 

} 

namexfO] av( I J; 
namexllj = av|2J: 
45 seqx(0) = getseqtnamexiOl, &len0); 

seqx( 1 1 = geiseqtnamexf 1 1. &lenl ); 
xbm = (dna)7 dbvai : j)bval; 

endgaps = 0; /* 1 to penalize endgaps */ 

50 ofile = "align-oui"; /* output file */ 

nw(); /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

primO; /* print stats, alignment */ 
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cleanup(O); /* unlink any tmp files */ 



60 



33 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCT/US00/05841 



10 



15 



20 



25 



30 



35 



40 



Table 1 (com') 

/* do the alignment, return best score: maini) 

* dna: values in Fitch and Smith. PNAS. 80. 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 

nw() 
{ 

/* seqs and ptrs */ 
/* keep track of dely */ 
/* keep track of delx */ 
/* for swapping rowO. row I */ 
/* score for each type */ 
/* insertion penalties */ 
/* diagonal index */ 
/* jmp index */ 
/* score for curr. last row */ 
/* index into seqs */ 

dx « (struct diag *)g - calloct"to get diags *. IcnO+lenl + 1 . sizeoftstruci diag)): 

ndely = t int *>g_calloc( "to get ndciy \ fen I + 1 . sizeoftint u: 
dely = tint -)g_calioc( "to get dely"*. lenl + 1 , sizeoftint)): 
colO » fini *)g_cailocrto get colO". lenl +1. sizeoftint)): 
coll = (int *)g - callocrto get coll - . lenl +1. sizeof(int»: 
insO = <dna>7 DINS0 : PINS0: 
insl = (dna)? DINS1 : PINS1; 

smax = -10000: 
if (endgaps) { 

ror(col0(0I = dely[0| = -insO. yy - 1: yy < = lenl; yy + + ) { 
colOlyyJ = delyfyy] = colO|yy-l| - insl; 
ndelyfyyj = vy: 

> 

colO(0] = 0: /• Waterman Bull Math Biol 84 V 



char 


*px. *py: 


int 


*ndely, *dely: 


int 


ndeix, delx: 


int 


*tmp: 


int 


mis: 


int 


insO. insl; 


register 


id: 


register 


u; 


register 


*colO, *coll; 


register 


xx. yy; 



} 

else 



forfyy = 1: yy < = lenl: yy-H+) 
delylyyj = -insO: 
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/* fill in match matrix 
•/ 

for (px = seqx(0|. xx = 1: xx < = lenO: px+ + . xx+ +) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 

if (xx « I) 

col I [01 = delx = -(insO+insl): 

else 

coli(0] = delx = col0[0]- insl; 
ndelx = xx: 



} 

else { 



col 1(0] » 0: 
delx = -insO; 
ndelx = 0; 
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fortpy = seqxfl], yy = 1: yy < = !enl; py++. yy++) { 
mis =» colO[yy-l]; 
if(dna) 

mis + = ubm(*px- , A , ]&xbm(*py- , A'I)7 DMAT : DMIS; 

else 

mis += - tiayl*px- , A'][*py- , A'|; 

/* update penalty for del in x seq: 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 

if (endgaps | | ndely|yy| < MAXGAP) { 

if (coi0(yy| - insO > = delylyyj) { 

delylyyj = colOfyyj - (insO+insl): 
ndely(yy] a | ; 

}eise{ 

delylyyj -= insl: 
ndely[yyj+ +: 

} else { 

■ if (col0(yyj - (insO + insh > = dely[yy]) { 
delylyyj = cnlOjyyj - (insU-Hnsl): 
ndely|yy] = l; 

} else 

ndeiy(yy|++; 

/* update penalty tor del in y seq: 

* favor new del over onuong de! 

if (endgaps | | ndeix < MAXGAP) { 

ir(coll|yy-i| - insO > » delx) { 

delx = col i f yy- i | - (insO+insi): 
ndeix = I; 

}else{ 

delx -= insl: 
ndeix+ + ; 

} 

}else{ 

if (col 1 1 yy- 1 1 • tinsO+ insl) > « delx) { 
delx = coll[yy-l j - (insO-Hinsi); 
ndeix =1; 

} else 

ndeix + +; 

} 

/* pick the maximum score: we're favoring 
• mis over any del and delx over dely 
*/ 
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Table I (com 1 ) 
id » xx - yy + leni - I; 

...nw 

if (mis > = delx && mis > =» deiy[yyj) 

collfyy] = mis: 
eise if (deix > = dely(yyj) { 

coilfyy] = delx: 

ij =» dxfidj.ijmp: 

if <dxfidj.jp.nt0) && Hdna 1 1 (ndeix > = MAXJMP 
&&xx > dx(id]Jp.x(ij] + MX) || mis > dx(id].score + DINSO)) { 
dx(idj.ijmp+ + : 
if ( + -Mj > = MAXJMP) { 
wruejmps(id); 
ij =* dx|id].ijmp = 0: 
dxfidj.offset = offset; 

offset + = sizeof(struct jmp) + sizeoffoffset): 

} 

dxfid].jp.n(ij| = ndeix: 
dx(idl.jp.xfij| = xx: 
dx(id|. score = deix: 

} 

else { 

collfyy) = delylyyl: 
ij - dx|id].ijmp: 
if (dx(id|.jp.n|0i (!dna [ j (ndely(yy| > = MAXJMP 

&& xx > dxfidJ.jp.x|ij| + MX) 1 1 mis > dx|id|.score + DINSO)) { 
dxfid!.ijmp-(-+: 
if ( + + MAXJMP) { 

wruejmpstid); 
ij dxfidj.ijmp = 0: 
dxfidj.offset = offset: 

offset += sizeoftstruct jmp; +• sizeof(offset); 

} 

dxfidl.jp.nfijj = -ndelyfyy]: 
dx(id|.jp.xfijj = xx: 
dxfidj.scorc =» dely[yyj; 

if (xx = = lenO && yy < ienl) { 
/•last col 
*/ 

if(endgaps) 

coillyyj .= insO+insl*(lent-yy); 
if (collfyy j > smaxj { 

smax = collfyyj; 
dmax = id; 

} 

} 

} 

if (endgaps xx < lenO) 

coll(yy-l| - = ins0 + insl*(len0-xx); 
if (coilfyy-l] > smax) { 

smax » coil (yy-il; 

dmax = id: 

} 

tmp = colO; coIO = coil; coll = tmp- 

} 

(void) free((char *)ndeiy): 
(void) free<(char *)deiy); 
(void) free((char *)coiO); 
(void) free((char *)coii); 
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/» 

a 

* print() - oniy routine visible outside this module 

0 

* static; 

* getmatO - trace back best path, count matches: print() 

* pr_align() « print alignment of described in array p(]: printO 

* dumpblockO - dump a block of lines with numbers, stars: pr aiignQ 

* numsO - put out a number iine: dumpblockO 

* putlineO -- put out a line (name. fnum|. seq. (num|): dumpblockO 

* starsO - -put a line of stars: dumpblockO 

* stripnamet ) strip any path and prefix from a seqname 
*/ 

^include "nw.h" 
^define SPC 3 

Adeline P_LINE 256 /* maximum output line */ 

^define PJPC 3 /* space between name or num and seq */ 

extern jiay(261[26I; 

in < i»len: /* set output line length */ 

FILE *fx: /* output tile */ 

prinK) 



{ 



print 

int Ix, ly. tirstgap. lastgap; /* overlap */ 

if ((fx = topeitfofile. *w")) = = 0) { 

rprintf(stderr.' , ^s: can't wrire %s\n*\ prog, ofile): 
cleanup( I ); 

} 

fprimf(fx. '< first sequence: %s (length = %d)\n\ namex(0|. lenO): 
rprinifffx. "< second sequence: %s (length = %d)\n\ namexll! lenl)- 
oien = 60; 
lx =* lenO: 
ly * lenl; 

tirstgap = lastgap - 0: 

if (dmax < lenl - 1) { /* leading gap in x */ 
pp(0I.spc = tirstgap = lent - dmax - 1; 
ly pptOJ.spc: 

} 

else if (dmax > lenl - I) { /* leading gap in y */ 
pp(l).spc =» firstgap = dmax - (lenl - I); 
Ix — pplU.spc: 

} 

if (dmaxO < lenO - I) { /* trailing gap in x */ 
lastgap » Ien0-dmax0-l: 
lx-=» lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap = dmaxO - (lenO - I); 
ly-= lastgap; 

} 

getmatflx, ly, tirstgap, lastgap); 
pr.aJignO; 
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/* 

* trace back the best path, count matches 
*/ 

static 

getmatdx. Iy. tirstgap. lastgap) 



getmat 




/* "core" (minus endgaps) */ 
/* leading trailing overlap */ 



int first gap, Jastgap: / 



int nm. iO. il. sizO. sizl; 



char outx(32]; 
double pet: 



register nO.nl; 
register char *pO. *pl: 



/* get total matches, score 
V 

iO = ii = sizO = sizl = 0: 
pO = seqx|0] + pptlj-spc: 
pi = seqxdl + pp(0].spc: 
nO = pp(l].spc + I: 
nl = pp(0|.spc + I: 

nm = 0: 

while < *pO&& *pl ) { 
if(sizO) { 

pH- + : 
nl + -t-; 
sizO--; 

} 

elseif (sizl) { 

pO++: 
nO+ + : 
sizl--: 

} 

else { 

if (xbm| *pO- ' A ' I<Stxbm| *p I - ' A ' ] ) 



if (n0+ + == pp(0|.xfiO|) 

sizO = pp|O|.nfi0+ + ]; 

if (nl + - -= pp(l|.x|il|) 

sizl « pp|l|.n|ii + +|: 

p0+ + ; 
pi + + : 



} 
} 

/* pet homology: 

* if penalizing endgaps. base is the shorter seq 

* else, knock off overhangs and take shorter core 
*/ 

if (endgaps) 

Ix = (lenO < leal)? lenO : lenl; 

else 

Ix a (lx < Iy)? Ix : ly; 
pet » 100.*(doubie)nm/(double)!x; 
fprintflfx. "\n"); 

fprintf(fe, - < %d matches in an overlap of %d: %.2f percent sinularity\n\ 
nm, (nm =:= t)? - : " es \ lx. pet); 



nm+ + : 
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Table 1 (conn 
rprintfffx, " <gaps in first sequence: %d", gapx); 
if (gapx) { 

(void) sprinttfoutx. * (%d %s%s)\ 

ngapx, (dna)7 "base": "residue", (ngapx = = 1)7 "Vs"); 
tprintf(fx. ,, %s\ outx); 

tprintfffx. ". gaps in second sequence: %d m . gapy); 

(void) sprinttifoutx, - (%d %$%s)\ 

ngapy. (dna)? "base": "residue", (ngapy I)? V); 
rprintf(fx."%s\ outx); 



...getmat 



} 

if (dna) 



else 



rpriruf(fx. 

"\n<score: %d (match =* %d. mismatch = %d. gap penaltv = %d + %d per base An" 
smax, DM AT. DMIS. DINSO. DINS I): 

fprintfffx, 

"\n<score: %d (Dayhoff PAM 250 mairix. gap penalty = %d + %d per residue)\n" 
smax. PINSO. PINS1): 



if (endgaps) 

fprintttfx. 



else 



} 



"<endgaps penalized, left endgap: %d Zs%s, right endgap: %d %s%s\n". 
tirstgap. tdna)7 "base" : "residue", (iirstgap ==!)? ■ "s". 
last gap, i dna)? "base" : "residue", (lastgap == 1)7 : "s"): 

fprintt'ffx. " < endgaps not penalized\n"): 



ststic «n: /* matches in core - fur checking */ 

static Imax: /* lengths or stripped tile names */ 

statk . ij{2J; /* jmp index tor a path */ 

static nc(2]; /* number at start of current line */ 

slatic ni(2J; /* current elem number » for gapping •/ 

static siz(2); 

static char *ps(2]; /* ptr to current element ♦/ 

static char *po[21: /* ptr to next output char slot */ 

static char out(2I[P_LINEl; /* output line */ 

static char star(P_LINE]: •* set by starso */ 

/• 

* print alignment of described in struct path pp(] 
*/ 

static 

pr.align(> p raligI1 

int nn; /* char count */ 

int more: 
register i; 

for (i = 0. Imax = 0; i < 2: i + +) { 
nn = siripnametnamex(i]); 
if (nn > Imax) 

Imax = nn: 

nc(ij =» I: 
ai[il = I: 
siz[fl = ijfl] = 0; 
ps(i] = seqx(i]; 
pofij = outfi]; 
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for (nn = nm = (X more - t : more: ) { _ pr 
for(i = more =* 0; i < 2: i + +) { 
/• 

5 * do we have more of this sequence? 

*/ 

if(!*psfi]) 

continue: 

to more++; 

if (pp(i).spc) { /* leading space */ 
*po(i}++ = ' '; 
pp(i].spc--: 

15 } 

else if (sizfi]) { /* in. a gap */ 
*po(i}++ = 
sizfil— : 

} 

20 eise{ /* we're putting a seq element 

♦/ 

*po(H = *ps(i|; 
if (islowerrpsfi})) 

*ps(i| * iouppen*ps|il): 

25 po(j) + + ; 

ps(i|+ + : 

/* 

• are we at next gap for this seq? 
30 */ 

if(ni[ii ==pp{iUij{ij]){ 
/* 

* we need to merge aJI gaps 

* at this location 

35 

siz(i] = PPlil-nfij[i] + + l: 
while (ni(t| == pp(i].x(ij[i]]) 

sizfi] +» ppli).n[ijlij+ + |: 

40 ni[il++: 

} 

} 

if < + +nn = = olen 1 1 !more &.&. nn> { 
dumpblockO: 

45 for(t =» 0; t < 2; i + +) 

po(i| » outfil: 

nn =» 0: 

} 

} 

50 } 
/* 



* dump a block of lines, including numbers, stars: pr alignO 
*/ 

55 static 

dumpblockO diimpbiock 
register i; 

60 for(i = 0; i < 2: i++) 

*pofi]- « AO'; 
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10 



15 



(void) putc('\n'. fx): 
for(i = 0: i < 2; i + +) { 

if(*out[i|<St&(*ourfi] != • ' || *(po[ij) ! = 
if (i == 0) 

nums(i); 
if (i o&& *omfl]) 
stars(); 

pudine(i): 

if (i 0&& *out(I|) 

fprimftfx. siar); 
if (i " I) 

nums(i): 



} 



')){ 



} 



..dmnpbiodt 
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30 



35 



40 



45^ 
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/* . 

* put out a number line: dumpblock() 
*/ 

static 
nums(ix) 

int ix: 



/* index in oui|| holding seq line V 



char nline[P_LINE]: 

register i, j; 

register char *pn. *px, *py; 

for(pn = niine. i « 0; i < Imax + P SPC: i + + . pn++) 
*pn * ' 

for(i = nc[ixl, py =» outfixl: *py; py+ +. pn + + ) { 
if(*py " ' ' I| *py == 
*pn = "; 

else { 

if(i%10 ==0 11(1==! &&nc[ixj !« I)) { 
j = (i < 0)7 -i : i; 
for(px a pn: j: j / = 10. px-) 
*px = j%10 + *0': 

if (i < 0) 

*px - 

} 



} 



else 

i + + ; 



*pn 



} 

*pn = \0': 
nc(ix] - i; 

for(pn « nline; *pn; pm- + ) 
(void) putc(*pn. fx): 
(void) putc('\n\ fx); 



/* 

* put out a line (name, [num], seq, [num|): dumpblock() 

static 

pudine(ix) 



int 



{ 



ix; 



putline 
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int 

register char 




for(px = namexfixl. i = 0: *px && * px != V; px + + , 

(void) putc(*px. fx); 
for(; i < lmax+P_SPC: i + +) 

(void) putcC '. fx): 

/* these count from I : 

* nif] is current element U'rom 1) 

* nc() is number at sun of current line 
7 

for (px = out(ix); *px: px + + ) 

(void) putc(*px&0x7F. fx); 
(void) putc('\n\ fx): 

} 



•^put a line of stars (seqs always in nut|0|. out| 1 |>: durnpblockO 

static 

starsO 

stars 

{ 

int i; 

register char *p0. *p I . ex. *px; 

. if (!*oui|0| 1 1 (*out(0| 7po|0|) = = ■ ') 1 1 

!*out(l) || (*om(l) = *(poft|) == • •)) 



px = siar: 

for(i - Imax + PSPC; t: i~) 
*px + + = ' *; 

for(pO = out[01. pi » outlll: *p0&& *pl: p0++. pi++) { 
if (isaipha(*pO) &.& isalpha(*p!)) { 

if (xbml'pO-'A'I&xbrnl'pI-'A*]) { 
cx = 
nm+ + ; 

} 

else if Hdna && ^dayCpO-'A'H'pl-'A'l > 0) 
cx = '.*; 

else 

cx = ' '; 

} 

else 

cx - J '; 
*px++ = cx: 

} 

*px+ + = '\n': 
*px = 'W; 
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Table 1 (conV) 

/* 

* strip path or prefix from pn. return len: pr alignf) 
•/ 

static 

stripname(pn) 

^ char *pn: /* file name (may be path) */ 

register char *px, *py; 
py = 0: 

for (px = pn; *px: px+ +) 
if('px " V) 

py » px + l: 

if(py)- 

(void) strcpy(pn, py); 
returnfstrten(pn)); 

} 



PCT/US0O/05841 



stripoame 
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Table 1 (com') 

/* ~ 

* cleanupO - cleanup any rnip fUe 

* getseqO - read in seq. set dna. len. maxlen 

* g_calloc() « calloc() with error checkin 

* readjmpsO - get the good jmps. from tmp tile if necessary 

* writejmpsO — write a tilled array of jmps to a tmp file: nwi > 
•/ 

^include "nw.h" 
^include <sys/file.h> 

char *jname « Vtmp/homgXXXXXX*; /* anp tile for imps */ 

RLE *fj; 

int cleanupO: /* cleanup tmp file */ 

long IseekQ; 

/* 

* remove any tmp file if we blow 
V 

cleanup^) cleanup 
int i: 

{ 

if(fj) 
exit(i): 

} 



(void) uniink(jname): 



/* 

* read, return ptr to seq. set dna. len. maxlen 

* skip lines starting with ':'.'<' f or ' > * 

* seq in upper or lower case 
V 

char * 

getseqffile. len) getseq 
cbar 'file; /* file name V 



{ 



int Men: /* seq len */ 

char linef 1024], *pseq; 

register char *px. *py; 

int natgc. tlen: 

FILE *fp; 

if ({fp = (open* file. V)) « 0) { 

fprintftstderr/^s: can't read %s\n\ prog, file); 
exit(l); 

} 

tlen = natgc ~ 0: 

while <fgets<line. 1024. hp)) { 

if (*Iine ==» ':' || 'line = = " < ' || *line = =* ' > ') 

continue: 
for(px ~ line; *p.x !=* \n'; px++) 

if (isupperi*px) || islower(*px)) 
tien+ + : 

} 

if ((pseq = maiIoc((unsignedXtlen+6))) =»=» 0) { 

n>rintf(stderr."%s: maliocO failed to get %d bvtes for %s\n", prog, tlen+6. file); 
exitfl); 

} 

pseqfO] = pseqjl] = pseq(2] = pseq(31 » '\0'; 



44 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCT/US00/0S841 



} 



Table 1 (com') 

py =» pseq + 4; 
*!en = lien: 
rewind(fp); 

while (fgetsdine. 1024. fp)) { 

if(*!ine == ':' || *line = = ' <' 1 1 +tine == *>') 

continue: 
for (px = line: *px ! = \n*: px+ +) { 
if (isupper(*px)) 

*py+ + = *px; 
else if (islower* *px» 

*py + + = toupper(*px): 
if (indexCATGCUV(py-l))) 
natgc + + : 

} 

} 

*py++ = '\0\ 
*py = 'W: 
(void) fclose(fp): 
dna = natec > ftlen/3); 
return! pseq ■+• 4); 



char * 

g_cailoc(msc. nx. sz) 

char *msg: 
int 



{ 



char 



ax. sz; 



/* program, calling routine */ 
/* number and size ot elements */ 



*px. *calIoc<); 



g calloc 



if ((px = cailocf (unsigned )nx. (unstgncd)sz)) = = 0) { 
if (*msg; { 

fprintf(stderr. "%y. g_callocO failed %s (n=%d. sz=%d)\n". prog. msg. nx. sz): 
exit(l); 

} 

} 

return* px): 



/* 

• get final jmps trom dx| | or tmp file, set pp| J. reset dmax: maim ) 
V 

readjmps() 
{ 

int fd = -l: 

int siz. iO. ii; 

register i. j. «; 



readjmps 



} 



(void) fciose(fj): 

if ((fd =» open(jname. 0_RDONLY. 0)) < 0) { 

fprintfCstderr. *%s: can t open() %s\n\ prog, jname): 
cieanup(t): 

} 



for (i » iO = il = 0, dmaxO = dmax. xx = !en0; * i + +) { 
while ( I) { 

for (j = dx(dmax|.ijmp: j > = 0 &&. dx(dmax|.jp.x[j] >=«; j-) 
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Table 1 (conn 

if (j < 0 && dx(dmax|.offset &&. tj) { ..-readjmps 
(void) IseeWfd. dx|dmax|.offse[. 0); 
(void) readttd. (char *)&dx|dmax|.jp. sizeoffsiruct jmp)); 
(void) readffd, (char *)&dx(dmax (.offset. sizeoffdxfdmax [.offset)); 
dxfdmaxl.ijmp = MAXJMP-I; 

else 

break: 

} 

if (i > = JMPS) { 

fprimf(siderr. too many gaps in alignments \ prog): 
cleanup(l): 

. ) 

if(j > * 0) { 

siz = dxfdmax|.jp.n{jj; 
xx =» dxfdmax|.jp.x(j]; 
dmax + = siz; 

if (siz < 0) { /• ga p in second seq v 

pp(l].nfil| = -siz; 
xx + = siz: 

/* id = xx - yy + | C n! - ! 
*/ 

PPlU.x|ii) = xx - dmax + lenl - i; 
gapy + + ; 
ngapy -= siz: 
/* ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP: 
+ + ; 

} 

else if (siz > 0) { /* gap m first seq */ 
pp(OI.nfi0| = siz; 
pplOl-xfiO] = xx: 
gapx+ + : 
ngapx + = siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
i0+ + : 

} 

} 

else 

break: 

} 

/* reverse the order of jmps 
*/ 

for (j - 0. i0-; j < iO; j-f- + . i0--) { 

i - PPlOJ.n0]: PP(0j.nUl = pp(0).n[i0|; pp[0].n(i01 = r 
^ i = pp(0J.x(j]; pp(0].x(jj a pp[0].x[i0]; ppl0].x[i0] = i; 

for(j = 0, il-; j < il; j + + . il-) { 

' « PPfl]-n(j]: pp(l].nfj] - pp(l].nfil]; pp[I].n[il] = i; 
^ « - PP(l].x{j]; pp(IJ.x(j] . pplll.xfil]; pp[l].xfiil - i; 

if (fd > = 0) 

(void) close(fd); 

(void) unltnkXjname); 

offcet =» 0; 

} 
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Table 1 (com') 

/• 

* write a filled jmp struct offset of the prev one (if anv)* nw() 
*/ 

writejmps(ix) writejmps 
int ix; 

{ 

char "mktempO; 
if(!Q){ 



if (mJctempijname) < 0) { 

fprimf(stden\ "7 Q s: cam mktempO %s\n". prog, jname): 
cleanup(l): 

15 } 

if ((fj = fopen(jname. "w")) =»» 0) { 

fprintf(stden\ "%s: can't write %s\n\ prog, jtume): 
exit(l): 

} 

20 } 

(void) fwriteUchar *)&dx[ixj.jp. sizeotfstruct jmp). I. fj); 
^ (void) rwritetuhar *)&dx|ix|. offset. sucof(dx(ix|. offset). V (j); 

25 
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Table 2 

PRO XXXXXXXXXXXXXXX (Length = 15 amino acids) 

5 Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

% amino acid sequence identity » 

(the number of identically matching amino acid residues between the rwo polypeptide sequences as determined by 
1 0 ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

5 divided by 15 = 33.3% 
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Table 3 



PR0 XXXXXXXXXX (Length = 10 amino acids) 

Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

5 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by 
ALIGN-2) divided by (the total number of amino acid residues ot" the PRO polypeptide) = 

10 

5 divided bv 10 = 50% 
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Table 4 



PRO-DNA 
Comparison DNA 



NNNNNNNNNNNNNN 
NNNNNNLLLLLLLLLL 



(Length = 14 nucleotides) 
(Length = 16 nucleotides) 



% nucleic acid sequence identity = 



(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN-2) 
divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) * 

10 

6 divided by 14 = 42.9% 
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Table 5 

PRO-DNA NNNNNNNNNNNN (Length - 12 nucieotides) 

Comparison DNA NNNNLLLVV (Lcnglh . 9 nucleotides) 

5 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN-2) 
divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

10 

4 divided by 12 = 33.3% 
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II. Compositions and Methods of the Invention 

1- Preparation of the PRO polypeptides of the invention 

The present invention provides newly identified and isolated nucleotide sequences encoding the 
polypeptides in the present application as PRO polypeptides. In particular. cDNAs encoding various PRO 
polypeptides have been identified and isolated, as disclosed in further detail in the Examples below. It is noted 
that proteins produced in separete expression rounds may be given different PRO numbers but the UNQ 
number is unique for any given DNA and the encoded protein, and will not be changed. However, for the sake 
of simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules 
disclosed herein as well as all further native homologues and variants included in the foregoing definition of 
PRO. will be referred to as "PRO/number" or even "PRO", regardless of their origin or mode of preparation. 

In particular. cDNA encoding a PRO200. PRO204. PR0212. PR0216, PR0226. PRO240. PR0235, 
PR0245. PR0172. PR0273. PR0272. PR0332. PR0526. PRO701. PR0361. PR0362, PR0363. PR0364. 
PR0356. PR0531. PR0533. PRO1083. PR0865. PRO770. PR0769. PR0788. PROUI4. PRO1007. 
PROI 184. PROI031. PR01346. PR01155. PRO1250. PR01312. PR0U92. PROJ246. PR01283. PROl 195. 
PR01343. PR01418. PR01387. PROI4I0. PR019I7. PR01868. PRO205. PR021. PR0269. PR0344. 
PR0333. PR038I. PRO720. PR0866. PRO840. PR0982. PR0836. PROI 159, PR01358. PR01325. 
PR0133S. PR01434. PR04333. PRO4302. PRO4430 and PR05727 polypeptide (corresponding to UNQ174. 
UNQ178. UNQI86. UNQ190, UNQ200. UNQ214, UNQ209. UNQ219. UNQ146. UNQ240. UNQ239. 
UNQ293. UNQ330. UNQ365. UNQ316. UNQ3I7, UNQ318. UNQ319, UNQ313. UNQ332. UNQ334. 
UNQ540. UNQ434. UNQ408. UNQ407, UNQ430, UNQ557. UNQ49I, UNQ598. UN0516. UNQ70I. 
UNQ585. UN0633. UNQ678. UNQ606. UNQ630. UNQ653. UNQ608, UNQ698, UNQ732. UNQ722, 
UNQ728. UNQ900. UNQ859, UNQI79. UNQ21, UNQ236. UNQ303. UNQ294. UNQ322. UNQ388. 
UNQ435. UN0433, UNQ483, UNQ545. UNQ589. UNQ707. UNQ685, UNQ693. UNQ739. UNQ1888, 
UNQI866. UNQ1947 and UNQ2448. respectively) has been identified and isolated, as disclosed in further 
detail in the Examples below. 

In even greater particularity, the present specification describes the cDNAs DNA29I0I-1276. 
DNA3037I-U57. DNA30942-1 134. DNA33087-1 158. DNA33460-1 166. DNA34387-1 138, DNA35553- 
1167. DNA35638-1I41, DNA359 16-1 161, DNA39523-1 192. DNA40620-1 183. DNA40982-I235. 
DNA44184-I319. DNA44205-I285. DNA454I0-1250, DNA45416-1251, DNA45419-1252. DNA47365- 
1206. DNA47470-1I30. DNA48314-I320. DNA49435-1219, DNA5092 1-1458, DNA53974-1401. 
DNA54228-1366, DNA5423 1-1366, DNA56405- 1357, DNA57033- 1403, DNA57690- 1374, DNA59220- 
1514. DNA59294-1381. DNA59776-1600, DNA59849-1504, DNA60775-1532, DNA6I873-1574, 
DNA628I4-1521. DNA64885-1529. DNA65404-1551, DNA65412-1523, DNA66675-1587. DNA68864- 
1629, DNA68872-1620, DNA68874-1622, DNA76400-252S, DNA77624-2515. DNA30868-1 156, 
DNA36638-1056, DNA38260-1 180, DNA40592-1242. DNA41374-1312, DNA44194-13I7, DNA53517- 
1366, DNA53971-1359, DNA53987-1438, DNA57700-1408, DNA59620-1463, DNA60627-1508, 
DNA64890-1612, DNA66659-1593, DNA66667-1596, DNA688 1 8-2536, DNA842 10-2576, DNA92218- 
2554, DNA96878-2626, DNA98853-1739 which encode native sequence PRO200, PRO204, PR0212, 
PR0216, PR0226, PRO240, PR0235, PR0245, PR0172, PR0273, PR0272, PR0332, PR0526, PRO701, 
PR0361, PR0362, PR0363. PR0364, PR0356, PR0531, PR0533, PRO1083, PR0865, PRO770, PR0769, 
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PR0788, PROM 14, PRO1007. PROI184. PRO103L PR01346, PR01I55. PRO1250. PR01312, PR01192. 
PR01246. PR01283, PROU95. PR01343, PR01418. PR01387, PRO1410. PR01917. PR01868. PRO205, 
PR021, PR0269, PR0344. PR0333. PR038L PRO720, PR0866, PRO840, PR0982, PR0836, PR01159, 
PR01358, PR01325, PROI338. PR01434, PR04333. PRO4302, PRO4430 and PR05727 polypeptides, 
respecriveiy. 

As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 
actual nucleotide sequence of those clones can readily be determined by the skilled artisan by sequencing of the 
deposited clone using routine meihods in the an. It is understood that the sequence of the deposit contains the 
correct sequence in the event of a discrepancy between the deposited sequence and those disclosed herein. The 
predicted amino acid sequence can be determined from the nucleotide sequence using routine skill. For the 
PRO polypeptides and encoding nucleic acids described herein. Applicants have identified what is believed to 
be the reading frame best identifiable with the sequence information available at the rime. 

B. PRO Polypeptide Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated 
that PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide 
changes into the PRO DN A. and/or by synthesis of the desired PRO polypeptide. Those skilled in the an will 
appreciate that amino acid changes may alter post- translation^ processes of the PRO. such as changing the 
number or position of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length PRO sequence or in various domains of the PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set forth, for instance, in U.S. Patent No. 5 ,364,934. Variations may be a substitution, deletion or 
insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one 
amino acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining 
which amino acid residue may be inserted, substituted or deleted without adversely affecting the desired 
activity may be found by comparing the sequence of the PRO with that of homologous known protein 
molecules and minimizing the number of amino acid sequence changes made in regions of high homology. 
Amino acid substitutions can be the result of replacing one ammo acid with another amino acid having similar 
structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e.. conservative 
amino acid replacements. Insertions or deletions may optionally be in the range of about 1 to 5 amino acids. 
The variation allowed may be determined by systematically making insertions, deletions or substitutions of 
amino acids in the sequence and testing the resulting variants for activity exhibited by the full-length or mature 
native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus 
or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. 
Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PRO 
polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 
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particular amino acid residues, or by digesting the DNA with suitable restnction enzymes and isolating the 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PGR). Oligonucleotides that define the desired 
termini of the DNA fragment are employed at the 5* and 3' primers in the PCR. Preferably, PRO polypeptide 
fragments share at least one biological and/or immunological activity with the native PRO polypeptide 
disclosed herein. 

in particular embodiments, conservative substitutions of interest are shown in Table 6 under the 
heading of preferred substitutions. If such substitutions result in a change in biological activity, then more 
substantial changes, denominated exemplary substitutions in Table 6. or as runner described below in reference 
to amino acid classes, are introduced and the products screened. 

Table 6 

Exemplary 



Original 
Residue 



Substitutions 



Preferred 
Substitutions 



Ala (A) 
Arg(R) 
Asn (N) 
Asp (D) 
Cys (C) 
Gin (0) 
GIu (E) 
Gly (G) 
His (H) 
He (I) 

Leu (L) 

Lys (K) 

Met (M) 

Phe (F) 

Pro(P) 

Ser(S) 

Thr(T) 

Trp(W) 

Tyr(Y) 

Val(V) 



val; leu; ile 

lys; gin; asn 

gin; his: lys; arg 

giu 

ser 

asn 

asp 

pro; ala 

asn; gin; lys; arg 

leu; val; met; ala: phe; 

norieucine 

norleucinc; ile; val: 

met: ala: phe 

arg; gin; asn 

leu; phe; ile 

leu; val; ile; ala; tyr 

ala 

thr 

ser 

tyr, phe 

trp; phe; thr; ser 
ile; leu; met; phe; 
ala; norieucine 



val 
lys 
gin 
glu 
ser 
asn 
asp 
ala 
arg 

leu 

ile 
arg 
leu 
leu 
ala 
thr 
ser 
tyr 
phe 

leu 



Substantial modifications in function or immunological identity of the invention polypeptide are 
accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) the structure 
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of the polypeptide backbone in the area or' the substitution, for example, as a sheet or helical conformation, (b) 
the charge or hydrophobiciry of the molecule at the target site, or (c) the bulk of the side chain. Naturally 
occurring residues are divided into groups based on common side-chain properties: 

( 1 ) hydrophobic: norleucine. met, ala. vaJ. leu, ile; 

(2) neutral hydrophiiic: cys, ser, thr; 

(3) acidic: asp, giu; 

(4) basic: asn. gin, his. lys, arg; 

(5) residues that influence chain orientation: gly, pro: and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another 
class. Such substituted residues also may be introduced into the conservative substitution sites or. more 
preferably, into the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oiigonucieotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et uL NucL 
Acids Res.. L3:433 1 f 1986): Zoller et ai. NucL Acids Res., U):6487 (1987)]. cassette mutagenesis (Weils et aL. 
Gene. 34:315 (1985)], restriction selection mutagenesis (Wells et aL. Philos. Trans. R. Soc. London SerA. 
3T7:4I5 (1986)| or other known techniques can be performed on the cloned DNA to produce the PRO variant 
variant DNA. 

Scanning ammo acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. 
Such ammo acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning 
amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to 
alter the main-chain conformation of the variant [Cunningham and Wells. Science. 244: 1081-1085 (1989)J. 
Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently found in 
both buried and exposed positions (Creighton. The Proteins. (W.H. Freeman & Co.. N.Y.); Chothia. J. Mo/. 
BioL. 150: 1 (1976)]. If alanine substitution does not yield adequate amounts of variant, an isotcric amino acid 
can be used. 

C. Modifications of PRO 

Covalent modifications of PRO polypeptides are included within the scope of this invention. One 
type of covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an 
organic derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal residues 
of the PRO. Derivatization with Afunctional agents is useful, for instance, for crossiinking PRO to a water- 
insoluble support matrix or surface for use in the method for purifying anti-PRO antibodies, and vice-versa. 
Commonly used crossiinking agents include, e.g., i,I-bis<diazoacetyl)-2-phenylethane. giutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobtfunctionai imidoesters, 
including disuccinimidyl esters such as 3 J f -dithiobis(succinimidyiproptonate), Afunctional maleimides such as 
bis-N-maieimido- 1,8 -octane and agents such as memyl-3-[(p-azidophenyl)ditmo]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyi residues to the corresponding 
glutamyl and asparryi residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side 
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chains fT.E. Creighton. Proteins: Structure and Molecular Properties. VV.H. Freeman & Co.. San Francisco, 
pp. 79-86 (1983)], aceryiation of the N-terminai amine, and amidation of any C-terminai carboxyi group. 

Another type of covalent modification of the PRO polypeptide included within the scope of this 
invention comprises altering the native glycosyiation pattern of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties 
found in native sequence PRO polypeptide (either by removing the underlying glycosyiation site or by deleting 
the glycosyiation by chemical and/or enzymatic means), and/or addingone or more glycosyiation sites that are 
not present in the native sequence PRO. In addition, the phrase includes qualitative changes in the 
glycosyiation of the native proteins, involving a change in the nature and proportions of the various 
carbohydrate moieties present. 

Addition of glycosyiation sites to the PRO polypeptide may be accomplished by altering the amino 
acid sequence. The alteration may be made, for example, by the addition of. or substitution by, one or more 
serine or threonine residues to the native sequence PRO (for O-linked glycosyiation sites). The PRO amino 
acid sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the 
desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g.. 
in WO 87/05330 published i I September 1987, and in Aplin and Wriston. CRC Crit. Rev. Biochem. % pp. 259- 
306(1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
or enzymaticaily or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosyiation. Chemical degiycosylation techniques are known in the an and described, for instance, by 
Hakimuddin. et aL Arch. Biochem. Biophys., 259:52 H987) and by Edge ct aL Anal Biochem.. 118:131 
( 198 1 ). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by "Thotakura at aL Meth. EnzymoL 138:350 f 1987). 

Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a 
variety of nonproteinaceous polymers, t \g.. polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylenes. in the manner set forth in U.S. Patent Mos. 4,640.835; 4,496,689; 4.301.144; 4,670,417; 
4,791,192 or 4,179,337. 

The PRO polypeptides may also be modified in a way to form a chimeric molecule comprising the 
invention polypeptide fused to another, heterologous polypeptide or ammo acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide 
which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally 
placed at the amino- or carboxyi- terminus of the PRO. The presence of such epitope- tagged forms of the PRO 
polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag 
enables the PRO to be readily purified by affinity purification using an anti-tag antibody or another type of 
affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective antibodies are well 
known in the art Examples include poly-histidine (poly-his) or poly-rustidine-glycine (poiy-his-giy) tags; the 
flu HA tag polypeptide and its antibody 12CA5 [Field et aL MoL Cell BioL, 8:2159-2165 (1988)]; the c-myc 
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tag and the 8F9. 3C7. 6E10, G4. B7 and 9E10 antibodies thereto [Evan et aL Molecular and Cellular Biology, 
5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Pafaorsky et at.. 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et aL 
Biotechnology, 6; 1204-1210 (1988)];' the fCT3 epitope peptide [Martin et aL Science, 255:192-194 (1992)]; an 
a-rubuiin epitope peptide [Skinner et aL J. Biol. Chem. t 266:15163-15166 (1991)]; and the T7 gene 10 
protein peptide tag [Lutz-Freyerrnuth et aL. Proc. Natl. Acad. ScL USA. 87:6393-6397,(1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric 
molecule (also referred to as an "immunoadhesin"). such a fusion could be to the Fc region of an IgG molecule; 
The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) 
form ot* an invention polypeptide in place of at least one variable region within an Ig molecule. In a 
particularly preferred embodiment, the immunoglobulin fusion includes the hinge. CH2 and CH3, or the hinge, 
CHI. CH2 and CH3 regions of an IgG I molecule. For the production of immunoglobulin fusions see also US 
Patent No. 5.428,130 issued June 27, 1995. 

D. Preparation of PRO 

The description below relates to primarily to production of PRO by cuiruring ceils transformed or 
transfected with a vector contaming PRO nucleic acid. It is. oi course, contemplated that alternative methods, 
which are well known in the an. may be employed to prepare PRO. For instance, the PRO sequence, or 
portions thereof, may be produced by direct peptide synthesis using solid-phase techniques fsee, e.g.. Stewart 
et a!., Solid-Phase Peptide Synthesis. W.H. Freeman Co., San Francisco. CA (1969): Merri field. J. Am. Chem. 
Soc. 85: 2149-2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be accomplished, for instance, using an Applied Biosystems Peptide 
Synthesizer (Foster City. CA) using the manufacturer's instructions. Various portions of the PRO may be 
chemically synthesized separately and combined using chemical or enzymatic methods to produce the full- 
length PRO. 

1. Isolation of DNA Fncodinc the PRO Polypeptide^) 

DNA encoding the PRO may be obtained from a cDNA library prepared from tissue believed to 
possess the polypeptide mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be 
conveniently obtained from a cDNA library prepared from human tissue, such as described in the Examples. 
The PRO-encoding gene may also be obtained from a genomic library, oligonucleotide synthesis, or other 
known synthetic procedures {e.g., automated nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO polypeptide or oligonucleotides 
of at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening 
the cDNA or genomic library with the selected probe may be conducted using standard procedures, such as 
described in Sambrook et aL Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor 
Laboratory Press, 1989). An alternative means to isolate the gene encoding the PRO polypeptide is to use PCR 
methodology [Sambrook et aL supra; Dieffenbach et aL PCR Primer: A Laboratory Manual (Cold Spring 
Harbor Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide 
sequences selected as probes should be of sufficient length and sufSciendy unambiguous that false positives 
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are minimized. The oligonucleotide is preferably labeled such that it can be detected upon hybridization to 
DNA in the library being screened. Methods of labeling are well known in the an, and include the use of 
radioiabeis like 32 P-labeled ATP, biotinyiation or enzyme labeling. Hybridization conditions, including 
moderate stringency and high stringency, are provided in Sambrook et aL supra. 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the an and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or 
genomic libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, 
using conventional primer extension procedures as described in Sambrook et ai. supra, to detect precursors 
and processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
2. Selection and Transformation of Host Cells 
Host ceils arc transfected or transformed with expression or cloning vectors described herein for 
production of the PRO polypeptides and cultured in conventional nutrient media modified as appropriate for 
inducing promoters, selecting transformants. or amplifying the genes encoding the desired sequences. The 
culture conditions, such as media, temperature. pH and the like, can be selected by the skilled artisan without 
undue experimentation. In general, principles, protocols, and practical techniques for maximizing the 
productivity of ceil cultures can be found in Mammalian Cell Biotechnology: A Practical Approach. M 
Buiier, cd. (IRL Press, i 991 ) and Sambrook et ai. supra. 

Methods of transfection are known to the ordinarily skilled artisan, for example. CaPC>4 and 
eiectroporation. Depending on the host ceil used, transformation is performed using standard techniques 
appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et aL. 
supra, or eiectroporation is generally used for prokaryotes or other ceils that contain substantial cell-wall 
barriers. Infection with Agrobacterium tumefaciens is used for trans formation of certain plant ceils, as 
described by Shaw et aL Gene. 22:315 M983) and WO 89/05859 published 29 June 1989, For mammalian 
cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb. Virology, 
52:456-457 (1978) can be employed. General aspects of mammalian ceil host system transformations have 
been described in U.S. Patent No. 4.399.216. Transformations into yeast are typically carried out according to 
the method of Van Solingen et aL J. Bacu. H0:946 (1977) and Hsiao et ai. Proc. Natl. Acad. Sci (USA), 
76:3829 (1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, 
eiectroporation. bacterial protoplast fusion with intact cells, or polycations. e.g.. polybrene, polyornithine. may 
also be used. For various techniques for transforming mammalian cells, see Keown et aL Methods in 
Enzymology, ^85:527-537 (1990) and Mansour et aL Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote. yeast, 
or higher eukaryote ceils. Suitable prokaryotes include but are not limited to eubacteria, such as Gram- 
negative or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains 
are publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli XI 776 (ATCC 31,537); E. 
colistnin W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host ceils include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella. Proteus. Salmonella, 
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e.g.. Salmonella typhimurium. Serratia. e.g.. Serratia marcescans. and Shigella, as well as Bacilli such as B. 
subtiiis and B. licheniformis (e.g.. B. licheniformis 4IP disclosed in DD 266.710 published 12 April 1989), 
Pseudomonas such as P. aeruginosa, and Strepwmyces. These examples are illustrative rather than limiting. 
Strain W3110 is one particularly preferred host or parent host because it is a common host strain for 
recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic 
enzymes. For example, strain W3U0 may be modified to effect a genetic mutation in the genes encoding 
proteins endogenous to the host, with examples of such hosts including E. coli W31 10 strain IA2. which has 
the complete genotype tonA : E. coli W3 1 10 strain 9E4. which has the complete genotype tonA ptr3: E. coli 
W3I10 strain 27C7 (ATCC 55,244), which has the complete genotype wnA ptr3 phoA El 5 (argF-lac)l69 
dcgP ompT kan": E. coli W3110 strain 37D6. which has the complete genotype tonA ptrJ phoA El 5 (argf- 
lac)169 degP ompT rbs? ihG kan : E. coli W3U0 strain 40B4. which is strain 37D6 with a non-kanamycin 
resistant degP deletion mutation: and an E. coli strain having mutant periplasmic protease disclosed in U.S. 
Patent No. 4,946.783 issued 7 August 1990. Alternatively, in vitro methods of cloning, e.g.. PCR or other 
nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes. eukaryotic microbes such as filamemous fungi or yeast are suitable cloning 
or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower 
eukaryotic host microorganism. Others include Schizosaccharomyccs pombe (Beach and Nurse. Nature. 290: 
140 [1981]: EP 139.383 published 2 May 1985): Klay\>eromyces hosts (U.S. Patent No. 4.943.529: Fleer et aL 
Bio/Technoiog\\ 9:968-975 (1991)) such as. e.g.. K. lactis (MW98-8C, CBS683, CBS4574: Louvencourt et aL. 
J. Bacterial.. K54(2):73 7-742 [1983]), K. fiagsiis (ATCC 12,424). K. bulgaricus (ATCC 16.045), K. 
wickeramii (ATCC 24.178), K. waltii (ATCC 56,500). K. drosophilantm (ATCC 36.906: Van den Berg et aL 
Bio/Technology, 8:135 (1990)), K. thermotolerans. and K. marxianus; yarrowia (EP 402.226): Pichia pastoris 
(EP 183.070: Srceknshna et aL J. Basic Microbiol.. 23:265-278 [1988]); Candida; Trichoderma reesia (EP 
244.234): Neurospora crassa (Case et aL. Proc. Nad. Acad. Sci. USA, 76:5259-5263 [1979]); Schwann^mvcw 
such as Schwann/cvmre* occidental (EP 394.538 published 3 1 October 1990); and filamentous rungi such as, 
e.g.. Neurospora. Penicillium. Tolypocladium (WO 91/00357 published 10 January 1991). and Aspergillus 
hosts such as A. nidulans (Ballance ct aL. Biochem. Biophys. Res. Commun.. M2:284-289 [1983]; Tilburn et 
aL. Gene. 26:205-221 [1983 j; Yelton et aL. Proc. Natl. Acad. Sci. USA, 8±: 1470-1474 [1984]) and A. niger 
(Kelly and Hynes. EMBOJ.. 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are 
not limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, 
Candida. Kloeckera. Pichia. Saccharomyces. Torulopsis. and Rhodotorula. A list of specific species that are 
exemplary of this class of yeasts may be found in C. Anthony, TJte Biochemistry of Methylotrophs, 269 (1982). 

Suitable host ceils for the expression of glycosylated PRO polypeptides are derived from multicellular 
organisms. Examples of invertebrate ceils include insect cells such as Drosophiia S2 and Spodoptera Sf9, as 
well as plant ceils. Examples of useful mammalian host ceil lines include Chinese hamster ovary (CHO) and 
COS cells. More specific examples include monkey kidney CV1 line transformed by SV40 (COS- 7, ATCC 
CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, 
Graham et aL J. Gen Virol. 36:59 (1977)); Chinese hamster ovary cellsADHFR (CHO, Urlaub and Chasin, 
Proc. Natl. Acad. ScL USA 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod 23:243-251 
(1980)); human lung cells (WI38. ATCC CCL 75); human liver ceils (Hep G2, HB 8065); and mouse 
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mammary tumor (MMT 060562. ATCC CCL51). The selection of the appropriate host cell is deemed to be 
within the skill in the art 

3. Selection and Use of a Replicable Vector 

The nucleic acid (e.g.. cDNA or genomic DNA) encoding the PRO polypeptides may be inserted into 
a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly 
available. The vector may. for example, be in the form of a piasmid, cosmid viral panicle, phagemid or phage. 
The appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In general, 
DNA is inserted into an appropriate restriction endonuciease site(s) using techniques known in the art. Vector 
components generally include, but are not limited to. one or more of a signal sequence, an origin of replication, 
one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. 
Construction of suitable vectors containing one or more of these components employs standard ligation 
techniques which are known to the skilled artisan. 

The PRO may be produced recombinantiy not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-termmus of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a pan of the PRO-encoding DNA that is insened into the vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase. Ipp. or heat-stable entcrotoxin II leaders. For yeast secretion the signal sequence may be, e.g.. 
the yeast invenase leader, alpha factor leader (including Saccharomyces and Klttyveromyces a-factor leaders, 
the latter described in U.S. Patent No. 5.010,182), or acid phosphatase leader, the C albicans giucoamylase 
leader (EP 362.179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 
1990. in mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral 
secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to 
replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and 
viruses. The origin of replication from the piasmid pBR322 is suitable for most Gram-negative bacteria, the 2u 
piasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus. VSV or BPV) are 
useful for cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable 
marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g.. 
ampiciilin. neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply 
cridcai nutrients not available from complex media, e.g.. the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the identification 
of ceils competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, 
prepared and propagated as described by Urlaub et ai, Proc. Natl. Acad. ScL USA, 77:4216 (1980). A suitable 
selection gene for use in yeast is the trp\ gene present in the yeast piasmid YRp7 [Stinchcomb et ai. Nature, 
282:39 (1979); Kingsman et ai. Gene, 7:141 (1979); Tschemper et aL. Gene, H>:157 (1980)]. The trp\ gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, 
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ATCC No. 44076 orPEP4-I [Jones. Genetics. 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding 
nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host ceils are 
well known. Promoters suitable for use with prokaryotic hosts include the (3-lactamase and lactose promoter 
5 systems [Chang et aL Nature, 275:615 (1978): Goeddel et aL Nature, 281:544 (1979)], alkaline phosphatase, 
a tryptophan (trp) promoter system [Goeddei. Nucleic Acids Res.. 8:4057 (1980); EP 36,776], and hybrid 
promoters such as the tac promoter [deBoer et aL. Proc. NatL Acad. ScL USA. 80:21-25 (1983)]. Promoters for 
use in bacterial systems also will contain a Shine-Dalgamo (S.D.) sequence operably linked to the DNA 
encoding PRO. 

10 Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

phosphogiycerate kinase (Hitzeman et aL J. Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes (Hess 
et aL J. Adv. Enzyme Reg., 7:149 (1968): Holland. Biochemistry, L7:4900 (1978)], such as enoiase, 
giyceraidehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phospho-fructokinase, 
glucose-6-phosphate isomerase. 3-phosphoglycerate mutase. pyruvate kinase, triosephosphate isomerase. 

1 5 phosphoglucose isomerase. and elucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C. acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metal lothionein. glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 

20 promoters for use in yeast expression are farther described in EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowipox virus (UK 2,21 1,504 published 5 July 
1989). adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and Simian Virus 40 (SV40). from heterologous mammalian promoters, e.g.. the 

25 actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible with the host cell systems. 

Transcription of a DNA encoding the PRO polypeptide by higher eukaryotes may be increased by 
inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA. usually about from 
10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known 

30 from mammalian genes (globin, elastase, albumin, a- fetoprotein, and insulin). Typically, however, one will 
use an enhancer from a eukaryotic ceil virus. Examples include the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a 
position 5' or 3' to the coding sequence of the PRO polypeptide, but is preferably located at a site 5' from the 

35 promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of 
transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5 f and, 
occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
40 segments transcribed as polyadeny lated fragments in the untranslated portion of the mRNA encoding PRO. 
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Still other methods, vectors, and host cells suitable for adaptation to the synthesis of the PRO 
polypeptide in recombinant vertebrate ceil culture are described in Gething et aL Nature, 293:620-625 (1981); 
Mantei a aL, Nature. 231:40-46(1979); EP 1 17,060; and EP ! 17,058. 

4. Detecting Gene Expression 

Gene expression may be measured in a sample directly, for example, by conventional Southern 
blotting. Northern blotting to quantitace the transcription of mRNA (Thomas, Proc. NatL Acad, Set. USA, 
77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize 
specific duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein 
duplexes. The antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to 
a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex 
can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and'or assay of 
sample tluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, 
the antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to DNA encoding the PRO 
polypeptide and encoding a specific antibody epitope. 

5. Purification of Polypeptide 

Forms of. the PRO may be recovered from culture medium or from host cell lysates. If membrane- 
bound, it can be released from the membrane using a suitable detergent solution (e.g.. Triton^X 100) or by 
enzymatic cleavage. Cells employed in expression of the PRO polypeptide can be disrupted by various 
physical or chemical means, such as freeze-thaw cycling, sonication. mechanical disruption, or cell lysing 
agents. 

It may be desired to purity PRO polypeptide from recombinant ceil proteins or polypeptides. The 
following procedures arc exemplary of suitable purification procedures: by fractionation on an ion-exchange 
column: ethanol precipitation: reverse phase HPLC; chromatography on silica or on a cation-exchange resin 
such as DEAE; chromatofocusing; SDS-PAGE:, ammonium sulfate precipitation; gel filtration using, for 
example. Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal 
chelating columns to bind epitope-tagged forms of the PRO polypeptide. Various methods of protein 
purification may be employed and such methods are known in the art and described for example in Deutscher, 
Methods in Enzymoiogy, J_82 (1990); Scopes, Protein Purification: Principles and Practice, Springer- Verlag, 
New York (1982). The purification step(s) selected will depend, for example, on the nature of the production 
process used and the particular PRO polypeptide produced. 

E. Tissue Distribution 

The location of tissues expressing the PRO can be identified by aetermining mRNA expression in 
various human tissues. The location of such genes provides information about which tissues are most likely to 
be affected by the stimulating and inhibiting activities of the PRO polypeptides. The location of a gene in a 
specific tissue also provides sample tissue for the activity blocking assays discussed below. 
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As. noted before, gene expression in various tissues may be measured by conventional Southern 
blotting, Northern blotting to quantitate the transcription of mRNA (Thomas. Proc. NatL Acad ScL USA, 
77:5201-5205 [1980]). dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize 
5 specific duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein 
duplexes. 

Gene expression in various tissues, alternatively, may be measured by immunological methods, such 
as immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product; Antibodies useful for immunohistochemical staining and/or assay of 

10 sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, 
the antibodies may be prepared against a native sequence of a PRO polypeptide or against a synthetic peptide 
based on the DNA sequences encoding the PRO polypeptide or against an exogenous sequence fused to a DNA 
encoding a PRO polypeptide and encoding a specific antibody epitope. General techniques for generating 
antibodies, and special protocols for Northern blotting and in situ hybridization are provided below. 

15 F. Antibody Bindinc Studies 

The activity of the PRO polypeptides can be further verified by antibody binding studies, in which the 
ability of anti-PRO200. anti-PRO204. anti-PR0212. anti-PR02l6. anti-PR0226, anti-PRO240. anti-PR0235. 
anti-PR0245. anti-PR0172, anti-PR0273. anti-PR0272, anti-PR0332, anti-PR0526, anii-PRO70l, anti- 
PR0361. anti-PR0362. anti-PR0363. anti-PR0364. ami-PR0356. anti-PR053 1 , anti-PR0533. anti-PROl083. 

20 anti-PROS65. anti-PRO770. anti-PR0769. anti-PR0788. anu-PRO!H4, anti-PRO1007. anti-PRO 1 1 84. anti- 
PROI031. anti-PROl346, anti-PROl 155, anti-PROl250. anti-PR013 12, ami-PROU92. anti-PRO 1 246. anti- 
PR01283, anti-PROM95, ahti-PR01343, ami-PR01418. anti-PROI387, anti-PROl410. anti-PROI917, anti- 
PR01868. anti-PRO205, anti-PR021. anti-PR0269, anti-PR0344, anti-PR0333. anti-PR0381. anti-PRO720. 
anti-PR0866. anti-PRO840. anti-PR0982. anti-PR0836, ana-PR01159, anti-PR01358. anti-PR01325. anti- 

25 PR01338. anti-PROl434. anti-PR04333. anti-PRO4302. anti-PRO4430 or anti-PR05727 antibodies to inhibit 
the effect of the PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235. PR0245, PR0172. 
PR0273. PR0272. PR0332. PR0526. PRO70I. PR0361. PR0362. PR0363, PR0364. PR0356. PR0531. 
PR0533. PRO1083, PR0865, PRO770, PR076.9. PR0788, PROU14, PRO1007, PR01184, PRO1031, 
PR01346. PROU55, PROI250, PR01312, PR01I92, PR01246, PR01283, PR01195. PROI343, PR01418. 

30 PR01387, PROI410. PR01917, PR0I868. PRO205. PR021, PR0269. PR0344, PR0333, PR0381, PRO720, 
PR0866, PRO840, PR0982, PR0836, PROU59, PR01358, PR01325, PR01338, PR01434, PR04333, 
PRO4302, PRO4430 or PR05727 polypeptides, respectively, on tissue cells is tested. Exemplary antibodies 
include polyclonal, monoclonal, humanized, bispecific. and heteroconjugate antibodies, the preparation of 
which will be described hereinbelow. 

35 Antibody binding studies may be carried out in any known assay method, such as competitive binding 

assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, Monoclonal Antibodies: A 
Manual of Techniques, pp.147-158 (CRC Press, Inc., 1987). 

Competitive binding assays rely on the ability of a labeled standard to compete with the test sample 
analyte for binding with a limited amount of antibody. The amount of target protein in the test sample is 

40 inversely proportional to the amount of standard that becomes bound to the antibodies. To facilitate 
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determining the amount of standard that becomes bound, the antibodies preferably are insolubilized before or 
after the competition, so that the standard and analyte that are bound to the antibodies may conveniently be 
separated from the standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different 
immunogenic portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte is 
bound by a first antibody which is immobilized on a solid support and thereafter a second antibody binds to 
the anaiyte. thus forming an insoluble three-pan complex. See, e.g.. US Pat No. 4,376,110. The second 
antibody may itself be labeled with a detectable moiety (direct sandwich 

assays) or may be measured using an antt-immunoglobuiin antibody that is labeled with a detectable moiety 
(indirect sandwich assay). For example, one type of sandwich assay is an ELISA assay, in which case the 
detectable moiety is an enzyme. 

For immunohistochemistry. the tissue sample may be fresh or frozen or may be embedded in paraffin 
and fixed with a preservative such as formalin, for example. 

G. Cell- Based Assavs 

Cell-based assays and animal models for immune related diseases can be used to further understand 
the relationship between the genes and polypeptides identified herein and the development and pathogenesis of 
immune related disease. 

In a different approach, cells of a cell type known to be involved in a particular immune related 
disease are transfected with the cDNAs described herein, and the ability of these cDNAs to stimulate or inhibit 
immune function is analyzed. Suitable ceils can be transfected with the desired gene, and monitored for 
immune function activity. Such transfected cell lines can then be used to test the ability of poly- or monoclonal 
antibodies or antibody compositions to inhibit or stimulate immune function, for example to modulate T-cell 
proliferation or inflammatory cell infiltration. Cells transfected with the coding sequences of the genes 
identified herein can further be used to identify drug candidates for the treatment of immune related diseases. 

In addition, primary cultures derived from transgenic animals (as described below) can be used in the 
cell-based assays herein, although stable cell lines arc preferred. Techniques to derive continuous cell lines 
from transgenic animals are well known in the an (see, e.g.. Small et ai. MoL Cell. Biol. 5: 642-648 [1985]). 

One suitable ceil based assay is the mixed lymphocyte reaction (MLR). Current Protocols in 
Immunology, unit 3.12: edited by J E Coligan, A M Kruisbeek, D H Margiies, E M Shevach. W Strober, 
National Institutes of Health. Published by John Wiley & Sons, Inc. In this assay, the ability of a test 
compound to stimulate or inhibit the proliferation of activated T cells is assayed. A suspension of responder T 
cells is cultured with allogeneic stimulator cells and the proliferation of T cells is measured by uptake of 
tritiated thymidine. This assay is a general measure of T cell reactivity. Since the majority of T cells respond 
to and produce IL-2 upon activation, differences in responsiveness in this assay in part reflect differences in IL- 
2 production by the responding cells. The MLR results can be verified by a standard lymphokine (IL-2) 
detection assay. Current Protocols in Immunology \ above, 3. 15, 6.3. 

A proliferative T cell response in an MLR assay may be due to direct mitogenic properties of an 
assayed molecule or to external antigen induced activation. Additional verification of the T cell stimulatory 
activity of the PRO polypeptides can be obtained by a costimulation assay. T cell activation requires an 
antigen specific signal mediated through the T-cell receptor (TCR) and a costimulatory signal mediated 
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through a second ligand binding interaction, for example, the B7 (CD80. CD86)/CD2S binding interaction. 
CD28 crossiinking increases lymphokine secretion by activated T cells. T cell activation has both negative and 
positive controls through the binding of ligands which have a negative or positive effect. CD28 and CTLA-4 
are related glycoproteins in the Ig superfamily which bind to B7, CD28 binding to B7 has a positive 
costimuiation effect of T cell activation; conversely, CTLA-4 binding to B7 has a negative T ceil deactivating 
effect. Chambers. C. A. and Allison. J. P.. Curr. Opin. Immunol. (1997) 9:396. Schwartz. R. H., Cell (1992) 
71:1065; Linsey, P. S. and Ledberter. J. A., Annu. Rev. Immunol. (1993) Mkl91; June, C H. et at, Immunol. 
Today (\994) 15:321; Jenkins. M. JC. ? Immunity ( 1994) h 405. In a costimuiation assay, the PRO polypeptides 
are assayed for T cell costimulatory or inhibitory activity. 

PRO polypeptides, as well as other compounds of the invention, which are stimulators (costimulators) 
of T cell proliferation and agonists, e.g., agonist antibodies, thereto as determined by MLR and costimuiation 
assays, for example, are useful in treating immune related diseases characterized by poor, suboptimai or 
inadequate immune function. These diseases are treated by stimulating the proliferation and activation of T 
ceils (and T cell mediated immunity) and enhancing the immune response in a mammal through administration 
of a stimulatory compound, such as the stimulating PRO polypeptides. The stimulating polypeptide may. for 
example, be a PRO200. PRO204. PR0212, PR0216. PR0226. PRO240. PR0235. PR0245. PR0172, 
PR0273. PR0272. PR0332. PR0526. PRO701. PR036I. PR0362. PR0363, PR0364. PR0356. PR0531, 
PR0533. PRO1083, PR0865, PRO770. PR0769. PR0788. PROI114, PRO1007, PROH84, PROI03K 
PR01346. PR01155. PRO1250. PR013I2. PROI192, PR01246, PR01283, PR01I95, PR01343. PR01418, 
PR01387. PRO1410, PR01917. PR01868. PRO205, PR021, PR0269, PR0344, PR0333, PR0381. PRO720. 
PR0866. PRO840. PR0982, PR0836, PROII59, PROI358, PROI325, PROI338, PR01434. PR04333, 
PRO4302, PRO4430 or PR05727 polypeptide or an agonist antibody thereof. 

Direct use of a stimulating compound as in the invention has been validated in experiments with 4- 
IBB glycoprotein, a member of the tumor necrosis factor receptor family, which binds to a ligand (4-lBBL) 
expressed on primed T cells and signals T cell activation and growih. Alderson, M. E. et aL J. Immunol. 
(1994) 24:2219. 

The use of an agonist stimulating compound has also been validated experimentally. Activation of 4- 
1BB by treatment with an agonist anti-4-lBB antibody enhances eradication of tumors. Hellstrom. i. and 
Heilstrom, K. E., Crit. Rew Immunol. ( 1 998) U?: 1 . Immunoadjuvant therapy for treatment of tumors, described 
in more detail below, is another example of the use of the stimulating compounds of the invention. 

An immune stimulating or enhancing effect can also be achieved by antagonizing or blocking the 
activity of a PRO which has been found to be inhibiting in the MLR assay. Negating the inhibitory activity of 
the compound produces a net stimulatory effect Suitable antagonists/blocking compounds are antibodies or 
fragments thereof which recognize and bind to the inhibitory protein, thereby blocking the effective interaction 
of the protein with its receptor and inhibiting signaling through the receptor. This effect has been validated in 
experiments using anti-CTLA-4 antibodies which enhance T cell proliferation., presumably by removal of the 
inhibitory signal caused by CTLA-4 binding. Walunas, T. L. et at, Immunity (1994) L405. 

Alternatively, an immune stimulating or enhancing effect can also be achieved by administration of a 
PRO which has vascular permeability enhancing properties. Enhanced vacuolar permeability would be 
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beneficial to disorders which can be attenuated by local infiltration of immune cells (e.g., monocytes, 
eosinophils, PMNs) and inflammation. 

On the other hand. PRO polypeptides, as well as other compounds of the invention, which are direct 
inhibitors of T cell proliferation/ activation, lymphokine secretion, and/or vascular permeability can be directly 
5 used to suppress the immune response. These compounds are useful to reduce the degree of the immune 
response and to treat immune related diseases characterized by a hyperactive, superoptimaL or autoimmune 
response. This use of the compounds of the invention has been validated by the experiments described above 
in which CTLA-4 binding to receptor B7 deactivates T cells. The direct inhibitory compounds of the invention 
function in an analogous manner. The use of compound which suppress vascular permeability would be 
10 expected to reduce inflammation. Such uses would be beneficial in treating conditions associated with 
excessive inflammation. 

Alternatively, compounds, e.g., antibodies, which bind to stimulating PRO polypeptides and block the 
stimulating effect of these molecules produce a net inhibitory effect and can be used to suppress the T cell 
mediated immune response by inhibiting T ceil proliferation/ activation and/or lymphokine secretion. Blocking 
1 5 the stimulating effect of the polypeptides suppresses the immune response of the mammal. This use has been 
validated in experiments using an anti-lL2 antibody. In these experiments, the antibody binds to 1L2 and 
blocks binding of IL2 to its receptor thereby achieving a T cell inhibitory effect. 
H. Animal Models 

The results of the cell based in vitro assays can be further verified using in vivo animal models and 

20 assays for T-cel! function. A variety of weii known animal models can be used to further understand the role 
of the genes identified herein in the development and pathogenesis of immune related disease, and to test the 
efficacy of candidate therapeutic agents, including antibodies, and other antagonists of the native polypeptides, 
including small molecule antagonists. The in vivo nature of such models makes them predictive of responses 
in human patients. Animal models of immune related diseases include both non- recombinant and recombinant 

25 (transgenic) animals. Non-recombinant animal models include, for example, rodent, e.g.. murine models. 
Such models can be generated by introducing cells into syngeneic mice using standard techniques, e.g.. 
subcutaneous injection, tail vein injection, spleen implantation, intraperitoneal implantation, implantation 
under the renal capsule, etc. 

Graft- versus-host disease occurs when immunocompetent cells are transplanted into 

30 immunosuppressed or tolerant patients. The donor cells recognize and respond to host antigens. The response 
can vary from life threatening severe inflammation to mild cases of diarrhea and weight loss. Graft-versus-host 
disease models provide a means of assessing T cell reactivity against MHC antigens and minor transplant 
antigens. A suitable procedure is described in detail in Current Protocols in Immunology, above, unit 4.3. 

An animal model for skin allograft rejection is a means of testing the ability of T cells to mediate in 

35 vivo tissue destruction and a measure of their role in transplant rejection. The most common and accepted 
models use murine tail-skin grafts. Repeated experiments have shown that skin allograft rejection is mediated 
by T cells, helper T cells and killer-effector T cells, and not antibodies. Auchincloss, H. Jr. and Sachs, D. H., 
Fundamental Immunology, 2nd ed„ W. E. Paul ed.. Raven Press, NY, 1989, 889-992. A suitable procedure is 
described in detail in Current Protocols in Immunology, above, unit 4.4. Other transplant rejection models 
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which can be used to test the compounds of the invention are the allogeneic heart transplant models described 
by Tanabe, M. et ai Transplantation (1994) 58:23 and Tinubu. S. A. et ai J. Immunol. (1994) 4330-4338. 

Animal models for delayed type hypersensitivity provides an assay of cell mediated immune function 
as well. Delayed type hypersensitivity reactions are a T cell mediated in vivo immune response characterized 
by inflammation which does not reach a peak until after a period of time has elapsed after challenge with an 
antigen. These reactions also occur in tissue specific autoimmune diseases such as multiple sclerosis (MS) and 
experimental autoimmune encephalomyelitis (EAE. a . model for MS). A suitable procedure is described in 
detail in Current Protocols in Immunology, above, unit 4.5. 

EAE is a T ceil mediated autoimmune disease characterized by T cell and mononuclear cell 
inflammation and subsequent demyelination of axons in the central nervous system. EAE is generally 
considered to be a relevant animal model for MS in humans. Bolton. G. Multiple Sclerosis (1995) 1:143. 
Both acute and relapsing-remirting models have been developed. The compounds of the invention can be 
tested for T cell stimulatory or inhibitory activity against immune mediated demyelinating disease using the 
protocol described in Current Protocols in Immunology, above, units 15.1 and 15.2. See also the models for 
myelin disease in which oligodendrocytes or Schwann cells arc grafted into the central nervous system as 
described in Duncan. 1. D. et ai Molec. Med. Today ( 1997) 554-561. 

Contact hypersensitivity is a simple delayed type hypersensitivity in vivo assay of cell mediated 
immune function. In this procedure, cutaneous exposure to exogenous haptens which gives rise to a delayed 
type hypersensitivity reaction which is measured and quamitated. Contact sensitivity involves an initial 
sensitizing phase followed by an eltcitation phase. The clicitation phase occurs when the T lymphocytes 
encounter an antigen to which they have had previous contact. Swelling and inflammation occur, making this 
an excellent mode! of human allergic contact dermatitis. A suitable procedure is described in detail in Current 
Protocols in Immunology, Eds. J. E. Cologan, A. M. Kruisbeek, D. H. Margulies. E. M. Shevach and W. 
Strober. John Wiley & Sons. Inc.. 1994. unit 4.2. See also Grabbe. S. and SchwarzvT. Immun. Today 19 (1): 
37-44(1998). 

An animal model for arthritis is collagen-induced arthritis. This model shares clinicaL histological 
and immunological characteristics of human autoimmune rheumatoid arthritis and is an acceptable model for 
human autoimmune arthritis. Mouse and rat models are characterized by synovitis, erosion of cartilage and 
subchondral bone. The compounds of the invention can be tested for activity against autoimmune arthritis 
using the protocols described in Current Protocols in Immunology, above, units 15.5. See aiso the model using 
a monoclonal antibody to CD 1 8 and VLA-4 integrins described in Issekutz, A.C. et aL Immunology (1996) 
88:569. 

A model of asthma has been described in which amigen-induced airway hyper-reactivity, pulmonary 
eosinophilia and inflammation are induced by sensitizing an animal with ovalbumin and then challenging the 
animal with the same protein delivered by aerosol. Several animal models (guinea pig, rat, non-human 
primate) show symptoms similar to atopic asthma in humans upon challenge with aerosol antigens. Murine 
models have many of the features of human asthma. Suitable procedures to test the compounds of the 
invention for activity and effectiveness in the treatment of asthma are described by Wolyniec. W. W. et aU Am. 
71 Respir. Cell Mol Biol (1998) Hi: 777 and the references cited therein. 
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Additionally, the compounds of the invention can be tested on animal models for psoriasis like 
diseases. Evidence suggests a T cell pathogenesis for psoriasis. The compounds of the invention can be tested 
in the scid/scid mouse model described by Schon. M P. et aL Nat. Med. (1997) 3:183, in which the mice 
demonstrate histopathologic skin lesions resembling psoriasis. Another suitable model is the human skin/scid 
mouse chimera prepared as described by Nickoioff. B. J. et at. Am. J. Path. (1995) 146:580. 

Recombinant (transgenic) animal models can be engineered by introducing the coding portion of the 
genes identified herein into the genome of animals of interest, using standard techniques for producing 
transgenic animals. .Animals that can serve as a target for transgenic manipulation include, without limitation, 
mice. rats, rabbits, guinea pigs, sheep, goats, pigs, and non-human primates, e.g., baboons, chimpanzees and 
monkeys. Techniques known in the an to introduce a transgene into such animals include pronucleic 
microinjection fHoppe and Wanger, U.S. Patent No. 4,873,191); retrovirus-mediated gene transfer into germ 
lines (e.g.. Van der Putten et aL Proc. Natl. Acad. Set. USA 82. 6148-615 (1985]); gene targeting in embryonic 
stem cells (Thompson et aL, Ceil 56. 313-321 [1989]); eiectroporation of embryos (Lo. Mot. CeL Biol. 3, 
1803-1814 [1983]); sperm-mediated gene transfer (Lavitrano et aL Cell 57. 717-73 [1989]). For review, see. 
for example. U.S. Patent No. 4.736.866. 

For the purpose of the present invention, transgenic animals include those that carry the transgene 
only in pan of their cells ("mosaic animals"). The transgene can be integrated either as a single transgene. or in 
concatamers. e.g., hcad-to-head or hcad-to-tail tandems. Selective introduction of a transgene into a particular 
ceil type is also possible by following, for example, the technique of Lasko et aL Proc. Natl. Acad. Sci. USA 
89,6232-636(1992). 

The expression of the transgene in transgenic animals can be monitored by standard techniques. For 
example. Southern blot analysis or PCR amplification can be used to verify the integration of the transgene. 
The level of mRNA expression can then be analyzed using techniques such as in situ hybridization. Northern 
blot analysis. PCR, or immunocytochemistry. 

The animals may be further examined for signs of immune disease pathology, for example by 
histological examination to determine infiltration of immune cells into specific tissues. Blocking experiments 
can also be performed in which the transgenic animals are treated with the compounds of the invention to 
determine the extent of the T cell proliferation stimulation or inhibition of the compounds. In these 
experiments, blocking antibodies which bind to the PRO polypeptide, prepared as described above, are 
administered to the animal and the effect on immune function is determined. 

Alternatively, "knock out" animals can be constructed which have a defective or altered gene 
encoding a polypeptide identified herein, as a result of homologous recombination between the endogenous 
gene encoding the polypeptide and altered genomic DNA encoding the same polypeptide introduced into an 
embryonic cell of the animaL For example, cDNA encoding a particular polypeptide can be used to clone 
genomic DNA encoding that polypeptide in accordance with established techniques. A portion of the genomic 
DNA encoding a particular polypeptide can be deleted or replaced with another gene, such as a gene encoding 
a selectable marker which can be used to monitor integration. Typically, several kilobases of unaltered 
flanking DNA (both at the 5* and 3* ends) are included in the vector [see e.g., Thomas and Capecchi, Cell, 
51:503 (1987) for a description of homologous recombination vectors]. The vector is introduced into an 
embryonic stem cell line {e.g. $ by eiectroporation) and cells in which the introduced DNA has homoiogously 
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recombined with the endogenous DNA are selected [see e.g.. Li et aL. Cell 69:915 (1992)]. The selected cells 
are then injected into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras (see e.g.. 
Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach. E. J. Robertson, ed. (IRL. 
Oxford. 1987), pp. 1 13-152]. A chimeric embryo can then be implanted into a suitable pseudopregnam female 
foster animal and the embryo brought to term to create a "knock out" animal. Progeny harboring the 
homologously recombined DNA in their germ cells can be identified by standard techniques and used to breed 
animals in which all cells of the animal contain the homologously recombined DNA. Knockout animals can be 
characterized for instance, for their ability to defend against certain pathological conditions and for their 
development of pathological conditions due to absence of the polypeptide. 
I. - Immuno Adjuvant Therapy 

In one embodiment, the immunostimulating compounds of the invention can be used in 
immunoadjuvant therapy for the treatment of tumors (cancer). It is now well established that T cells recognize 
human tumor specific antigens. One group of tumor antigens, encoded by the MAGE, BAGE and GAGE 
families of genes, are silent in all adult normal tissues . but are expressed in significant amounts in tumors, 
such as melanomas, iune rumors, head and neck tumors, and bladder carcinomas. DeSmet. C. et aL, (1996) 
Proc. Nad. Acad. Sci. USA. 93:7149. It has been shown that costimulation of T ceils induces rumor regression 
and an antitumor response both in vitro and in vivo. Meiero, I. at aL. Nature Medicine (1997) 3:682: Kwon. E. 
D. et aL. Proc. NatL Acad ScL USA (1997) 94: 8099: Lynch, D. H. et al, Nature Medicine (1997) 3:625: Finn. 
O. J. and Lotze. M. T.. J. Immunol. (1998) 2U 14. The stimulatory compounds of the invention can be 
administered as adiuvants, alnne or rnoprh^r m; r k ,.-„,„.u 

chemotherapeutic agent, to stimulate T ceil proliferation/activation and an antitumor response to tumor 
antigens. The growth regulating, cytotoxic, or chemotherapeutic agent may be administered in conventional 
amounts using known administration regimes. Immunostimulating activity by the compounds of the invention 
allows reduced amounts of the growth regulating, cytotoxic, or chemotherapeutic agents thereby potentially 
lowering the toxicity to the patient. 

J. Screening As.savs for Drutz Candidates 

Screening assays for drug candidates are designed to identity compounds that bind to or complex with 
the polypeptides encoded by the genes identified herein or a biologically active fragment thereof, or otherwise 
interfere with the interaction of the encoded polypeptides with other cellular proteins. Such screening assays 
will include assays amenable to high- throughput screening of chemical libraries, making them particularly 
suitable for identifying small molecule drug candidates. Small molecules contemplated include synthetic 
organic or inorganic compounds, including peptides, preferably soluble peptides, (poly)peptide- 
immunogiobulin fusions, and. in particular, antibodies including, without limitation, poly- and monoclonal 
antibodies and antibody fragments, single-chain antibodies, ami- idiotypic antibodies, and chimeric or 
humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. The 
assays can be performed in a variety of formats, including protein-protein binding assays, biochemical 
screening assays, immunoassays and cell based assays, which are well characterized in the an. 

All assays are common in that they call for contacting the drug candidate with a polypeptide encoded 
by a nucleic acid identified herein under conditions and for a time sufficient to allow these two components to 



interact 
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In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the polypeptide encoded by the gene identified herein or the 
drug candidate is immobilized on a solid phase, e.g., on a microliter plate, by covalent or non-covaient 
attachments. Non-covaient attachment generally is accomplished by coating the solid surface with a solution 
of the polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific 
for the polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed by 
adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 
component, e.g., the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 
detected. When the originally non-immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labelled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular protein encoded by a gene 
identified herein, us interaction with that protein can be assayed by methods well known for detecting protein- 
protein interactions. Such assays include traditional approaches, such as. cross-iinking, co- 
immunoprecipuauon. and co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co- 
workers (Fields and Song. Nature (Londonl 340. 245-246 (1989); Chien et ai. Proc. Natl. Acad. Sci. USA 88, 
9578-95S2 (1991)] as disclosed by Chevray and Nathans. Proc. Natl. Acad. ScL USA 89. 5789-5793 (1991). 
Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one 
acting as the DNA-binding domain, while the other one functioning as the transcription activation domain. 
The yeast expression system described in the foregoing publications (generally referred to as the "two-hybrid 
system") takes advantage of this property, and employs two hybrid proteins, one in which the target protein is 
fused to the DNA-binding domain of GAL4. and another, in which candidate activating proteins are fused to 
the activation domain. The expression of a QALUlacZ reporter gene under control of a GAL4-activated 
promoter depends on reconstirution of GAL4 activity via protein-protein interaction. Colonies containing 
interacting polypeptides are detected with a chromogenic substrate for 0-galactosidase. A complete kit 
(MATCHMAKER rM ) for identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

In order to find compounds that interfere with the interaction of a gene identified herein and other 
intra- or extracellular components can be tested, a reaction mixture is usually prepared containing the product 
of the gene and the intra- or extracellular component under conditions and for a time allowing for the 
interaction and binding of the two products. To test the ability of a test compound to inhibit binding, the 
reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added to 
a third reaction mixture, to serve as positive control. The binding (complex formation) between the test 
compound and the intra- or extracellular component present in the mixture is monitored as described above. 
The formation of a complex in the control reaction(s) but not in the reaction mixture containing the test 
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compound indicates that the test compound interferes with the interaction of the test compound and its reaction 
partner. 

K. Compositions and Methods for the Treatment of Immune Related Diseases 

The compositions useful in the treatment of immune related diseases include, without limitation, 
proteins, antibodies, small organic molecules, peptides, phosphopeptides, antisense and ribozyme molecules, 
triple helix molecules, etc. that inhibit or stimulate immune tunction. for example. T cell 
proiiferauon/activation, iymphokine release, or immune cell infiltration. 

For example, antisense RNA and RNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g.. between about -10 and +10 
positions of the target gene nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 
known techniques. For further details see. e.g.. Rossi. Current Bioiogy 4. 469-471 (1994). and PCT 
publication No. WO 97/33551 (published September 18. 1997). 

Nucleic acid molecules in triple helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucieotides. The base composition of these oligonucleotides is designed such that it 
promotes triple helix formation via Hoogsteen base pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For fanner details sec, e.g., PCT publication No. WO 
97/33551, supra. 

These molecules can be identified by any or any combination of the screening assays discussed above 
and/or by any other screening techniques well known for those skilled in the art. 
L Antibodies 

The present invention further provides anti-PRO antibodies and fragments thereof which may inhibit 
(antagonists) or stimulate (agonists) T cell proliferation, eosinophil infiltration, vascular permeability, etc. 
Such anti-PRO antibodies or fragments thereof include polyclonal, monoclonal, humanized, bispecific and 
heteroconjugate antibodies. 

1. Polyclonal Antibodies 

The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and, if desired, an adjuvant Typically, the immunizing agent 
and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 
immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine 
thyrogiobulin. and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 
The immunization protocol may be selected by one skilled in the art without undue experimentation. 
2. Monoclonal Antibodies 
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The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may 
be prepared using hybridoma methods, such as those described by fCohler and Milstein. Nature, 256:495 
(1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized 
with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will 
specifically bind to the immunizing agent Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes CPBLs") are used if cells of human origin are desired, or 
spleen cells or lymph node ceils are used if non-human mammalian sources are desired. The lymphocytes are 
then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma ceil [Goding. Monoclonal Antibodies: Principles and Practice, Academic Press. (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian ceils, particularly myeloma cells of rodent, 
bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may 
be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth 
or survival of the untiised. immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphonbosyl transferase (HGPRT or HPRT). ihe culture medium for the hybridomas typically will 
include hypoxanthine. ammoptenn. and thymidine ("HAT medium"), which substances prevent the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines arc murine myeloma iines, which can be obtained, for instance, from the Salk 
Institute Cell Distribution Center. San Diego, California and the American Type Culture Collection. Manassas, 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor. /. Immunol., 133:3001 (1984); Brodeur et aL 
Monoclonal Antibody Production Techniques and Applications, Marcel Dekker, Inc.. New York. (1987) pp. 
51-63]. 

The culture medium in which the hybridoma ceils are cultured can then be assayed for the presence of 
monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are 
known in the an. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem., 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Goding, supra]. Suitable culture media for this purpose include, 
for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma 
cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 
medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxyapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily 
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isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of 
binding specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma ceils 
of the invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into 
expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis 
of monoclonal antibodies in the recombinant host cells. The DNA also may be modified, for example, by 
substituting the coding sequence for human heavy and light chain constant domains in place of the homologous 
murine sequences [U.S. Patent No. 4,816,567; Morrison et aL supra] or by covalendy joining to the 
immunoglobulin coding sequence all or pan of the coding sequence for a non- immunoglobulin polypeptide. 
Such a non-imrriunoglobulin polypeptide can be substituted for the constant domains of an antibody of the 
invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. ' 

The antibodies are preferably monovalent antibodies. Methods for preparing monovalent antibodies 
are well known in the art. For example, one method involves recombinant expression of immunoglobulin light 
chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to 
prevent heavy chain crossiinking. Alternatively, the relevant cysteine residues are substituted with another 
amino acid residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly. Fab fragments, can be accomplished using routine techniques known 
in the art. 

3. Human and Humanized Antibodies 

The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human {eg,, murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are replaced by 
corresponding non-human residues. Humanized antibodies may also comprise residues which are found 
neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized 
antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or 
substantially al! of the CDR regions correspond to those of a non-human immunoglobulin and all or 
substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically 
that of a human immunoglobulin [Jones et aL Nature. 321:522-525 (1986); Riechmann et aL Nature, 
332:323-329 (1988); and Presta, Curr. Op. Struct BioL 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These 
non-human amino acid residues are often referred to as "import" residues, which are typically taken from an 
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"import" variable domain. Humanization can be essentially performed following the method of Winter and 
coworkers [Jones et aL. Nature. 32L-522-525 (1986); Riechmann et aL. Nature. 332:323-327 (1988); 
Verhoeyen et aL Science. 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 
5 antibodies (U.S. Patent No. 4.816,567), wherein substantially less than an intact human variable domain has 
been substituted by the corresponding sequence from a non-human species. In practice, humanized antibodies 
are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by 
residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 

10 display libraries' ( Ho ogenboom and Winter. J. MoL Biol.. 227:381 (1991); Marks et aL J. MoL BioL 222:581 
(1991)]. The techniques of Cole at aL and Boerner et aL are also available for the preparation of human 
monoclonal antibodies (Cole et aL. Monoclonal Antibodies and Cancer Tlierapy, Alan R. Liss, p. 77 (1985); 
Boerner et aL. J. Immunol.. ]47(l):86-95 (1991); U.S. 5,750, 373). Similarly, human antibodies can be made 
by introducing of human immunoglobulin loci into transgenic animals, e.g.. mice in which the endogenous 

15 immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody 
production is observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent 
Nos. 5.545,807: 5.545.806: 5.569.825; 5,625,126; 5,633.425: 5.661.016. and in the following scientific 
publications: Marks et aL Bio/Technology \Q. 779-783 (1992); Lonberg et aL Nature 368: 856-859 (1994): 

20 Morrison. Nature 368. 812-13 (1994); Fishwiid et aL Nature Biotechnology J4, 845-51 (1996); Neuberger, 
Nature Biotechnology ]4. 826 (1996): Lonberg and Huszar, Intern. Rev. Immunol. ]3 65-93 (1995). 

The antibodies may also be affinity matured using known selection and/or mutagenesis methods as 
described above. Preferred affinity matured antibodies have an affinity which is five times, more preferably 10 
times, even more preferably 20 or 30 times greater than the starting antibody (generally murine, humanized or 

25 human) from which the matured antibody is prepared. 

4. Bispecific Antibodies 
Bispccific antibodies arc monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities may be for 
the PRO. the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 

30 subunit. 

Methods for making bispecific antibodies are known in the an. Traditionally, the recombinant 
production of bispecific antibodies is based on the coexpression of two immunoglobulin heavy-chain/light- 
chain pairs, where the two heavy chains have different specificities (Milstein and Cuello, Nature. 305:537-539 
[1983]). Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas 

35 (quadromas) produce a potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 1993. and in 
Trauneckerer*/.. EMBOJ., 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) 

40 can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
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heavy-chain constant domain, comprising at least pan of the hinge. CH2. and CH3 regions. It is preferred to 
have the first heavy-chain constant region (CHI) containing the site necessary for light-chain binding present 
in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and. if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are cotransfected into a suitable 
5 host organism. For further details of generating bispecific antibodies see, for example, Suresh et ai. w Methods 
in Enzymology, \2V.1\ 0(1986). 

According to another approach described in WO 96/2701 1, the interface between a pair of amibody 
molecules can be engineered to maximize the percentage of heterodimers which are recovered from 
recombinant cell culture. The preferred interface comprises at least a pan of the CH3 region of an antibody 

10 constant domain. In this method, one or more small amino acid side chains from the interface of the first 
antibody molecule are replaced with larger side chains (e.g., tyrosine or tryptophan). Compensatory "cavities" 
of identical or similar size to the large side chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with smaller ones (e.g., alanine or threonine). This 
provides a mechanism for increasing the yield of the heterodimer over other unwanted end-products such as 

15 homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments {e.g., F(ab*) 2 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 
linkage. Brcnnan et aL. Science 229:81 (1985) describe a procedure wherein intact antibodies are 

20 proteolyticaily cleaved to generate F(ab') 2 fragments. These fragments arc reduced in the presence of Ac 
dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermoiccular disulfide 
formation. The Fab* fragments generated are then convened to thionitrobenzoate (TNB) derivatives. One of 
the Fab*-"TNB derivatives is then reconvened to the Fab'-thioi by reduction with mercaptoethyiamine and is 
mixed' with an equimolar amount of the other Fab' -TNB denvative to form the bispecific antibody. The 

25 bispecific antibodies produced can be.uscd as agents for the selective immobilization of enzymes. 

Fab* fragments may be directly recovered from £. coli and chemically coupled to form bispecific 
antibodies. Shaiaby et a/.. J. Exp. Med. £75:217-225 (1992) desenbe the production of a rally humanized 
bispecific antibody F(ab*)> molecule. Each Fab' fragment was separately secreted from E. coli and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 

30 able to bind to cells ovcrexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast rumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant 
cell culture have also been described. For example, bispecific antibodies have been produced using leucine 
zippers. Kosteiny etaL.J. Immunol. J48 (5): 1547- 1 553 (1992). The leucine zipper peptides from the Fos and 

35 Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody 
homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by Hollinger et a/., Proc. NatL Acad. ScL USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain 

40 variable domain ( V H ) connected to a light-chain variable domain (V L ) by a linker which is too short to allow 
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pairing between the two domains on the same chain. Accordingly, the V„ and V L domains of one fragment 
are forced to pair with the complementary V L and V„ domains of another fragment, thereby forming two 
antigen-binding sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has aJso been reported. See. Gruber et aL.J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be 
prepared. Tun et aL, J. ImmunoL U7:60 ( 1991 ). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypepide 
herein. Alternatively, an anti-PRO arm may be combined with an arm which binds to a triggering molecule on 
a leukocyte such as a T-celi receptor molecule (e.g.. CD2, CD3. CD28, or B7\ or Fc receptors for IgG (FcyR), 
such as FcyRI (CD64), FcyRJI (CD32) and FcvRIII (CD16) so as to focus cellular defense mechanisms to the 
cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize cytotoxic 
agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO polypeptide - 
binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA. 
DOTA. or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue 
factor (TF). 

5. Heteroconimiate Antibodies 

Heteroconjugate antibodies arc composed of two covalentiy joined antibodies. Such antibodies have, 
for example, been proposed to target immune system cells to unwanted cells [U.S. Patent No. 4,676.980]. and 
for treatment of HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the 
antibodies may be prepared in vitro using known methods in synthetic protein chemistry, including those 
involving crossiinking agents. For example, irnmunotoxins may be constructed using a disulfide exchange 
reaction or by forming a thioether bond Examples of suitable reagents for this purpose include iminothiolate 
and methyi-4-mercaptoburyrimidate and those disclosed, for example, in U.S. Patent No. 4.676.980. 

6. Effector function engineering 

It may be desirable to modify ihe antibody of the invention with respect to effector function, so as to 
enhance the effectiveness of the antibody in treating an immune related disease, for example. For example 
cysteine residuets) may be introduced in the Fc region, thereby allowing interchain disulfide bond formation in 
this region. The homodimeric antibody thus generated may have improved internalization capability and/or 
increased complement-mediated ceil killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron 
et aL / Exp Med. F76:l I9MI95 (1992) and Shopes, B. 1 ImmunoL 148:2918-2922 (1992). Homodimeric 
antibodies with enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as 
described in Wolff et aL Cancer Research 53:2560-2565 (1993). Alternatively, an antibody can be engineered 
which has dual Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See 
Stevenson et aL. Anti-Cancer Drug Design y.2 19-230 (1989). 

7. rnununoconiueates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic 
agent such as a chemotherapeutic agent, toxin (e.g. an enzymatically active toxin of bacterial, fungal, plant or 
animal origin, or fragments thereof), or a radioactive isotope (i.e.. a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described 
above. Enzymaticaily active toxins and fragments thereof which can be used include diphtheria A chain, 
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nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytoiaca 
americana proteins (PAPI, PAP II. and PAP-S), momordica charantia inhibitor, curcin, crotuu sapaonaria 
officinalis inhibitor, gelonin. mitogeilin. restnctocin. phcnomycin, enomycin and the tricothecenes. A variety 
of radionuclides are available for the production of radioconjugated antibodies. Examples include 2,2 Bi t m I, 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunciionai protein 
coupling agents such as N-succmimidyl-3-(2-pyridyldithioi) propionate (SPDP). iminothiolane (IT), 
Afunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as 
disuccinimidylsuberate). aldehydes (such as glutaraidehyde), bis-azido compounds (such as bis (p- 
azidobenzoyi) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
ethylenediamme), diisocyanates fsuch as toiyene 2.6-diisocyanate), and bis-active fluorine compounds (such as 
l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in Vitetta et 
aL . Science 238: 1098 (1987). Carbon- !4-labeled ^ l-isothiocyanatobenzyi-3-methyIdiethyiene 
mamtnepeniaacetic acid (MX-DTPA) is an exemplary cheiating agent tor conjugation of radionucieotide to the 
antibody. See W094/1 1026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tissue pretargetmg wherein the antibody-receptor conjugate is administered to the patient, 
followed by removal of unbound conjugate from the circulation using a clearing agent and then administration 

*** - — •**• v W ».ju 6 uiCu kkj a wjriuiOAic ageru (e.g., u ruuiuuuuieuuuc). 

8. Immunoliposomes 

The proteins, antibodies, etc. disclosed herein may also be formulated as immunoliposomes. 
Liposomes containing the antibody arc prepared by methods known in the art. such as described in Epstein et 
aL Proc. NatL Acad. ScL U£4.J[2:3688 (1985): Hwang et aL Proc. Natl Acad Sci. US.4, 77:4030 (1980); 
and U.S. Pat. Nos. 4.485.045 and 4.544.545. Liposomes with enhanced circulation time are disclosed in U.S. 
Patent No. 5,013.556. 

Particularly useful liposomes can be generated by the reverse phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol and PEG-derivatized phosphatidylethanolamine 
(PEG-PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired 
diameter. Fab f fragments of the antibody of the present invention can be conjugated to the liposomes as 
described in Martin et aLJ. Bioi Chem. 257: 286-288 (1982) via a disulfide interchange reaction. A 
chemotherapeuuc agent (such as doxorubicin) may be optionally contained within the liposome. See Gabizon 
et aL J- National Cancer Inst. 8| (19) 1484 (1 989). 

M. Pharmaceutical Compositions 

The active PRO molecules of the invention (e.g„ PRO polypeptides. anti-PRO antibodies, and/or 
variants of each) as well as other molecules identified by the screening assays disclosed above, can , be 
administered for the treatment of immune related diseases* in the form of pharmaceutical compositions. 

Therapeutic formulations of the active PRO molecule, preferably a polypeptide or antibody of the 
invention, are prepared for storage by mixing the active molecule having the desired degree of purity with 
optional pharmaceutical^ acceptable carriers, excipients or stabilizers (Remington's Pharmaceutical Sciences 
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1 6th edition. Osoi, A. Ed. (1980]) ? in the form of lyophilized formulations or aqueous solutions. Acceptable 
carriers, excipients. or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and 
include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid and 
methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; hexamethonium chloride; 
benzaikonium chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens such as methyl 
or propyl paraben: catechol: resorcinol; cyciohexanol; 3-pentanol: and m-cresol): low molecular weight (less 
than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine. asparagine, 
histidine. arginine. or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, 
mannose. or dextrins; chelating agents such as EDTA; sugars such as sucrose, mannitoi. trehalose or sorbitol; 
sait-forming counter-ions such as sodium: metal complexes (e.g.. Zn-protein complexes): and/or non-ionic 
surfactants such as TWEEN™. PLURONICS™ or polyethylene glycol (PEG). 

Compounds identified by the screening assays disclosed herein can be formulated in an analogous 
manner, using standard techniques well known in the an. 

Lipofections or liposomes can also be used to deliver the PRO molecule into cells. Where antibody 
fragments arc used, the smallest inhibitory fragment which specifically binds to the binding domain of the 
target protein is preferred. For example, based upon the variable region sequences of an antibody, peptide 
molecules can be designed which retain the ability ro bind the target protein sequence. Such peptides can be 
synthesized chemically and/or produced by recombinant DNA technology (see, e.g.. Marasco et aL Proc. Natl. 
Acad. Sci. USA 90, 7889-7893 [1993j). 

The formulation herein may also contain more than one active compound as necessary for the 
particular indication being treated, preferably those with complementary activities that do not adversely affect 
each other. Alternatively, or in addition, the composition may comprise a cytotoxic agent, cytokine or growth 
inhibitory agent. Such molecules are suitably present in combination in amounts that are effective for the 
purpose intended. 

The active PRO molecules may also be entrapped in microcapsules prepared, for example, by 
coacervation techniques or by interfacial polymerization, for example, hydroxymethylcellulose or eeiatin- 
microcapsuics and poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems 
(for example, liposomes, albumin microspheres, microemuisions. nano-particles and nanocapsules) or in 
macroemulsions. Such techniques are disclosed in Remington's Pharmaceutical Sciences 1 6th edition. Osol, 
A. Ed. (1980). 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished 
by filtration through sterile filtration membranes. 

Sustained-release preparations or the PRO molecules may be prepared. Suitable examples of 
sustained-release preparations include semipermeable matrices of solid hydrophobic polymers containing the 
antibody, which matrices are in the form of shaped articles, e.g.. films, or microcapsules. Examples of 
sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl.methacryiate), or 
poiy(vinylalcohol)), polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y-ethyl-L- 
glutamate, non-degradable ethylene-vinyi acetate, degradable lactic acid-glycolic acid copolymers such as the 
LUPRON DEPOT™ (injectable microspheres composed of lactic, acid-glycolic acid copolymer and leuprolide 
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acetate), and poiy-D-(-)-3-hydroxybutyric acid. While polymers such as cthyiene-vinyl acetate and lactic acid- 
giycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins for shorter time 
periods. When encapsulated antibodies remain in the body for a long time, they may denature or aggregate as a 
result of exposure to moisture at 37°C t resulting in a loss of biological activity and possible changes in 
immunogeniciry. Rational strategies can be devised for stabilization depending on the mechanism involved. 
For example, if the aggregation mechanism is discovered to be intermolecular S-S bond formation through 
thio-disuifide interchange, stabilization may be achieved by modifying sulfhydryl residues, lyophilizing from 
acidic solutions, controlling moisture content, using appropriate additives, and developing specific polymer 
matrix compositions. 

N. Methods of Treatment 

It is contemplated that the polypeptides, antibodies and other active compounds of the present 
invention may be used to treat various immune related diseases and conditions, such as T cell mediated 
diseases, including those characterized by infiltration of inflammatory cells into a tissue, stimulation of T-cell 
proliferation, inhibition of T-ceil proliferation, increased or decreased vascular permeability or the inhibition 
thereof 

Exemplary conditions or disorders to be treated with the polypeptides, antibodies and other 
compounds of the invention, include, but are not limited to systemic lupus erythematosis. rheumatoid arthritis, 
juvenile chronic arthritis, osteoarthritis, spondyloarthropathies, systemic sclerosis (scleroderma), idiopathic 
inflammatory myopathies (dermatomyositis, polymyositis), Sjogren's syndrome, systemic vasculitis, 
sarcoidosis, autoimmune hemolytic anemia (immune pancytopenia, paroxysmal nocturnal hemoglobinuria), 
autoimmune thrombocytopenia (idiopathic thrombocytopenic purpura, immune-mediated thrombocytopenia), 
thyroiditis (Grave's disease. Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, atrophic thyroiditis), 
diabetes mellitus. immune-mediated renal disease (glomerulonephritis, tubuiointerstitial nephritis), 
demyelinating diseases, of the central and peripheral nervous systems such as multiple sclerosis, idiopathic 
demyelinating polyneuropathy or Guillain-Barre syndrome, and chronic inflammatory demyelinating 
polyneuropathy, hepatobiliary diseases such as infectious hepatitis (hepatitis A. B. C. D. E and other non- 
hepatbtropic viruses), autoimmune chronic active hepatitis, primary biliary cirrhosis, granulomatous hepatitis, 
and sclerosing cholangitis, inflammatory bowel disease (ulcerative colitis: Crohn's disease), gluten-sensitive 
enteropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases including bullous skin 
diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, allergic 
rhinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the lung such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and hypersensitivity pneumonitis* transplantation 
associated diseases including graft rejection and graft -versus-host-disease. 

In systemic lupus erythematosus, the central mediator of disease is the production of auto-reactive 
antibodies to self proteins/tissues and the subsequent generation of immune-mediated inflammation, antibodies 
either directly or indirectly mediate tissue injury. Though T lymphocytes have not been shown to be directly 
involved in tissue damage, T lymphocytes are required for the development of auto-reactive antibodies. The 
genesis of the disease is thus T lymphocyte dependent. Multiple organs and systems are affected clinically 
including kidney, lung, musculoskeletal system, mucocutaneous, eye, central nervous system, cardiovascular 
system, gastrointestinal tract, bone marrow and blood. 
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Rheumatoid arrhritis (RA) is a chronic systemic autoimmune inflammatory disease that mainly 
involves the synovial membrane of multiple joints with resultant injury to the articular cartilage. The 
pathogenesis is T lymphocyte dependent and is associated with the production of rheumatoid factors, auto- 
antibodies directed against self IgG. with the resultant formation of immune complexes that attain high levels 
in joint fluid and blood. These complexes in the joint may induce the marked infiltrate of lymphocytes and 
monocytes into the synovium and subsequent marked synovial changes; the joint space/fluid if infiltrated by 
similar cells with the addition of numerous neutrophils. Tissues affected are primarily the joints, often in 
symmetrical pattern. However, extra-anicular disease also occurs in two major forms. One form is the 
development of extra-anicular lesions with ongoing progressive joint disease and typical lesions of pulmonary 
fibrosis, vasculitis, and cutaneous ulcers. The second form of extra-anicular disease is the so called Felty's 
syndrome which occurs late in the RA disease course, sometimes after joint disease has become quiescent, and 
involves the presence of neutropenia, thrombocytopenia and splenomegaly. This can be accompanied by 
vasculitis in multiple organs with formations of infarcts, skin ulcers and gangrene. Patients often also develop 
rheumatoid nodules in the subcutis tissue overlying affected joints; the nodules late stage have necrotic centers 
surrounded by a mixed inflammatory ceil infiltrate. Other manifestations which can occur in RA include: 
pericarditis, pieuritis. coronary anerins. intestitial pneumonitis with pulmonary fibrosis, keratoconjunctivitis 
sicca, and rhematoid nodules. 

Juvenile chronic arthritis is a chronic idiopathic inflammatory disease which begins often at less than 
16 years of age. its phenorype has some similarities to RA; some patients which are rhematoid factor positive 
are classified as juvenile rheumatoid arthritis. The disease is sub-classified into three major categories: 
paucianicular. polyarticular, and systemic. The arthritis can be severe and is typically destructive and leads to 
joint ankylosis and retarded growth. Other manifestations can include chronic anterior uveitis and systemic 
amyloidosis. 

Spondyloarthropathies are a group of disorders with some common clinical features and the common 
association with the expression of HLA-B27 gene product. The disorders include: ankylosing sponyiitis. 
Reiter\s syndrome (reactive arthritis ». arthritis associated with inflammatory bowel disease, spondylitis 
associated with psoriasis, juvenile onset .spondyloarthropathy and undifferentiated spondyloarthropathy. 
Distinguishing features include sacroileitis with or without spondylitis; inflammatory asymmetric arthritis; 
association with HLA-B27 (a serologically defined allele of the HLA-B locus of class 1 MHC); ocular 
inflammation, and absence of autoantibodies associated with other rheumatoid disease. The cell most 
implicated as key to induction of the disease is the CD8+ T lymphocyte, a ceil which targets antigen presented 
by class I MHC molecules. CD8+ T cells may react against the class I MHC allele HLA-B 27 as if it were a 
foreign peptide expressed by MHC class 1 molecules. It has been hypothesized that an epitope of HLA-B 27 
may mimic a bacterial or other microbial antigenic epitope and thus induce a CD8+ T cells response. 

Systemic sclerosis (scleroderma) has an unknown etiology. A hallmark of the disease is induration of 
the skin; likely this is induced by an active inflammatory process. Scleroderma can be localized or systemic; 
vascular lesions are common and endothelial cell injury in the microvasculature is an early and important event 
in the development of systemic sclerosis; the vascular injury may be immune mediated. An immunologic basis 
is implied by the presence of mononuclear cell infiltrates in the cutaneous lesions and the presence of anu- 
nuciear antibodies in many patients. ICAM-1 is often upregulated on the ceil surface of fibroblasts in skin 
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lesions suggesting that T ceil interaction with these ceiis may have a role in the pathogenesis of the disease. 
Other organs involved include: the gastrointestinal tract: smooth muscle atrophy and fibrosis resulting in 
abnormal peristaisis/motility: kidney: concenrric subendothelial intimai proliferation affecting small arcuate 
and interlobular arteries with resultant reduced renal conical blood flow, results in proteinuria, azotemia and 
5 hypertension; skeletal muscle: atrophy, interstitial fibrosis; inflammation; lung: interstitial pneumonitis and 
interstitial fibrosis: and heart: contraction band necrosis, scarring/fibrosis. 

Idiopathic inflammatory myopathies including dermatomyositis, polymyositis and others are disorders 
of chronic muscle inflammation of unknown etiology resulting in muscle weakness. Muscle 
injury/inflammation is often symmetric and progressive. Autoantibodies are associated with most forms. 

10 These myositis-specific autoantibodies are directed against and inhibit the function of components, proteins 
and RNA's. involved in protein synthesis. 

Sjogren's syndrome is due to immune-mediated inflammation and subsequent functional destruction 
of the tear glands and salivary glands. The disease can be associated with or accompanied by inflammatory 
connective tissue diseases. The disease is associated with autoantibody production against Ro and La antigens. 

1 5 both of which are small RNA-protein complexes. Lesions result in keratoconjunctivitis sicca, xerostomia, with 
other manifestations or associations including bilary cirrhosis, peripheral or sensory neuropathy, and palpable 
purpura. 

Systemic vasculitis are diseases in which the primary lesion is inflammation and subsequent damage 
to blood vessels which results in ischemia/ necrosis/degeneration to tissues supplied by the affected vessels and 

20 eventual end-organ dysfunction in some cases. Vaxcuiitides can also occur as a secondary lesion or sequelae to 
other immune-infiammaiory mediated diseases such as rheumatoid arthritis, systemic sclerosis, etc.. 
particularly in diseases also associated with the formation of immune complexes. Diseases in the primary 
systemic vasculitis group include: systemic necrotizing vasculitis: polyarteritis nodosa, allergic angiitis and 
granulomatosis, polyangiitis: Wegener's granulomatosis; lymphomatoid granulomatosis; and giant cell arteritis. 

25 Miscellaneous vasculitides include: mucocutaneous lymph node syndrome (MLNS or Kawasaki's disease), 
isolated CNS vasculitis. Behet's disease, thromboangiitis obliterans (Buergers disease) and cutaneous 
necrotizing venuiitis. The pathogenic mechanism of most of the types of vasculitis listed is believed to be 
primarily due to the deposition of immunoglobulin complexes in the vessel wall and subsequent induction of 
an inflammatory response either via ADCC, complement activation, or both. 

30 Sarcoidosis is a condition of unknown etiology which is characterized by the presence of epithelioid 

granulomas in nearly any tissue in the body; involvement of the lung is most commoa The pathogenesis 
involves the persistence of activated macrophages and lymphoid cells at sites of the disease with subsequent 
chronic sequelae resultant from the release of locally and systemicaily active products released by these cell 
types. 

35 Autoimmune hemolytic anemia including autoimmune hemolytic anemia, immune pancytopenia, and 

paroxysmal nocrural hemoglobinuria is a result of production of antibodies that react with antigens expressed 
on the surface of red blood cells (and in some cases other blood cells including platelets as well) and is a 
reflection of the removal of those antibody coated ceils via complement mediated lysis and/or ADCC/Fc- 
receptor-mediaied mechanisms. 
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In autoimmune thrombocytopenia including thrombocytopenic purpura, and immune-mediated 
thrombocytopenia in other clinical settings, platelet destruction/removal occurs as a result of either antibody or 
complement attaching to platelets and subsequent removal by complement lysis, ADCC or FC-receptor 
mediated mechanisms. 

Thyroiditis including Grave's disease, Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, and 
atrophic thyroiditis, are the result of an autoimmune response against thyroid antigens with production of 
antibodies that react with proteins present in and often specific for the thyroid gland. Experimental models 
exist including spontaneous models: rats (BUF and BB rats) and chickens (obese chicken strain); inducible 
models: immunization of animals with either thyrogiobulin. thyroid microsomal antigen (thyroid peroxidase). 

Type I diabetes mcllitus or insulin-dependent diabetes is the autoimmune destruction of pancreatic 
islet (3 cells; this destruction is mediated by auto-antibodies and auto-reactive T cells. Antibodies to insulin or 
the insulin receptor can also produce the phenotype of insulin-non-responsiveness. 

Immune mediated renal diseases, including glomerulonephritis and cubulointerstitiai nephritis, are the* 
result of antibody or T lymphocyte mediated injury to renal tissue either directly as a result of the production of 
autoreactive antibodies or T cells against renal antigens or indirectly as a result of the deposition of antibodies 
and/or immune complexes in the kidney that are reactive against other non-renal antigens. Thus other 
immune-mediated diseases that result in the formation of immune-complexes can also induce immune 
mediated renal disease as an indirect sequelae. Both direct and indirect immune mechanisms result in 
inflammatory response that produces/ induces lesion development in renal tissues with resultant organ function 
impairment and in some cases progression to renal failure. Both humoral and cellular immune mechanisms can 
be involved in the pathogenesis of lesions. 

Demyelinating diseases of the central and peripheral nervous systems, including Multiple Sclerosis; 
idiopathic demyelinating polyneuropathy or Guillain-Barre syndrome; and Chronic Inflammatory 
Demyelinating Polyneuropathy, arc believed to have an autoimmune basis and result in nerve demyciination as 
a result of damage caused to oligodendrocytes or to myelin directly. In MS there is evidence to suggest that 
disease induction and progression is dependent on T lymphocytes. Multiple Sclerosis is a demyelinating 
disease that is T lymphocyte-dependent and has either a relapsing-remitting course or a chronic progressive 
course: The etiology is unknown; however, viral infections, geneuc predisposition, environment, and 
autoimmunity all contribute. Lesions contain infiltrates of predominantly T lymphocyte mediated, microglial 
ceils and infiltrating macrophages; CD4+T lymphocytes are the predominant cell type at lesions. The 
mechanism of oligodendrocyte cell death and subsequent demyelination is not known but is likely T 
lymphocyte driven. 

Inflammatory and Fibrotic Lung Disease, including Eosinophilic Pneumonias; Idiopathic Pulmonary 
Fibrosis, and Hypersensitivity Pneumonitis may involve a disregulated unmune-inflamrnatory response. 
Inhibition of that response would be of therapeutic benefit. 

Autoimmune or Immune-mediated Skin Disease including Bullous Skin Diseases, Erythema 
Multiforme, and Contact Dermatitis are mediated by auto-antibodies, the genesis of which is T lymphocyte- 
dependent 

Psoriasis is a T lymphocyte- mediated inflammatory disease. Lesions contain infiltrates of T 
lymphocytes, macrophages and antigen processing ceils, and some neutrophils. 
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Allergic diseases, including asthma; allergic rhinitis: atopic dermatiris: food hypersensitivity; and 
urticaria are T lymphocyte dependent. These diseases are predominantly mediated by T lymphocyte induced 
inflammation. IgE mediated- inflammation or a combination of both. 

Transplantation associated diseases, including Graft rejection and Graft-Versus-Host-Disease 
(GVHD) are T lymphocyte-dependent; inhibition of T lymphocyte function is ameliorative. 
Other diseases in which intervention of the immune and/or inflammatory response have benefit are infectious 
disease including but not- limited to viral infection (including but not limited to AIDS, hepatitis A, B, C, D, E 
and herpes) bacterial infection, fungal infections, and protozoal and parasitic infections (molecules (or 
derivatives/agonises) which stimulate the MLR can be utilized therapeutically to enhance the immune response 
to infectious agents), diseases of immunodeficiency (molecuies/derivatives/agonists) which stimulate the MLR 
can be utilized therapeutically to enhance the immune response for conditions of inherited, acquired, infectious 
induced (as in HIV infection), or iatrogenic (<.<?., as from chemotherapy) immunodeficiency), and neoplasia. 

It has been demonstrated that some human cancer patients develop an antibody and/or T lymphocyte 
response to antigens on neoplastic cells. It has also been shown in animal models of neoplasia that 
enhancement of the immune response can result in rejection or regression of that particular neoplasm. 
Molecules that enhance the T lymphocyte response in the MLR have utility in vivo in enhancing the immune 
response against neoplasia. Molecules which enhance the T lymphocyte proliferative response in the MLR (or 
small molecule agonists or antibodies that affected the same receptor in an agonistic fashion) can be used 
therapeutically to treat cancer. Molecules that inhibit the lymphocyte response in the MLR also function in 
vivo during neoplasia to suppress the immune response to a neoplasm; such molecules can either be expressed 
by the neoplastic cells themselves or their expression can be induced by the neoplasm in other cells. 
Antagonism of such inhibitory molecules (either with antibody, small molecule antagonists or other means) 
enhances immune- mediated tumor rejection. 

Additionally, inhibition of molecules with proinflammatory properties may have therapeutic benefit in 
rcperrusion injury; stroke; myocardial infarction; atherosclerosis; acute lung injury; hemorrhagic shock; bum; 
sepsis/septic shock; acute tubular necrosis: endometriosis; degenerative joint disease and pancreatis. 

The compounds of the present invention, e.g.. polypeptides or antibodies, are administered to a 
mammal, preferably a human, in accord with known methods, such as intravenous administration as a bolus or 
by continuous infusion over a period of time, by intramuscular, intraperitoneal intracerobrospinal, 
subcutaneous, intra-articuiar. intrasynovial. intrathecal, oral, topical, or inhalation (intranasal, intrapulmonary) 
routes. Intravenous or inhaled administration of polypeptides and antibodies is preferred. 

In immunoadjuvant therapy, other therapeutic regimens, such administration of an anti-cancer agent, 
may be combined with the administration of the proteins, antibodies or compounds of the instant invention. 
For example, the patient to be treated with a the immunoadjuvant of the invention may also receive an anti- 
cancer agent (chemotherapeutic agent) or radiation therapy. Preparation and dosing schedules for such 
chemotherapeutic agents may be used according to manufacturers' instructions or as determined empirically by 
the skilled practitioner. Preparation and dosing schedules for such chemotherapy are also described in 
Chemotherapy Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). The chemotherapeutic 
agent may precede, or follow administration of the immunoadjuvant or may be given simultaneously therewith. 
Additionally, an anri-oestrogen compound such as tamoxifen or an anti-progesterone such as onapristone (see, 
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EP 6 1 68 1 2) may be given in dosages known for such molecules. 

It may be desirable to also administer antibodies against other immune disease associated or tumor 
associated antigens, such as antibodies which bind to CD20, CD I la. CD 18. ErbB2. EGFR, ErbB3. ErbB4, or 
vascular endothelial factor (VEGF). Alternatively, or in addition, rwo or more antibodies binding the same or 
5 rwo or more different antigens disclosed herein may be coadministered to the patient. Sometimes, it may be 
beneficial to also administer one or more cytokines to the patient. In one embodiment, the PRO polypeptides 
are coadministered with a growth inhibitory agent. For example, the growth inhibitory agent may be 
administered first, followed by a PRO polypeptide. However, simultaneous administration or administration 
first is also contemplated. Suitable dosages for the growth inhibitory agent are those presently used and may 
1 0 be lowered due to the combined action (synergy) of the growth inhibitory agent and the PRO polypeptide. 

For the treatment or reduction in the severity of immune related disease, the appropriate dosage of an 
a compound of the invention will depend on the type of disease to be treated, as defined above, the severity and 
course of the disease, whether the agent is administered for preventive or therapeutic purposes, previous 
therapy, the patient's clinical history and response to the compound, and the discretion of the attending 
1 5 physician. The compound is suitably administered to the patient at one time or over a series of treatments. 

For example, depending on the type and severity of the disease, about I ug/kg to 1 5 mg/kg (e.g., 0.1- 
20 mg/kg) of polypeptide or antibody is an initial candidate dosage for administration to the patient, whether, 
for example, by one or more separate administrations, or by continuous infusion. A typical daily dosage might 
range from about I Mg/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated 
20 administrations over several days or longer, depending on the condition, the treatment is sustained until a 
desired suppression of disease symptoms occurs. However, other dosage regimens may be useful. The 
progress of this therapy is easily monitored by conventional techniques and assays. 

O. Articles of Manufacture 

In another embodiment of the invention, an article of manufacture containing materials {e.g., 
25 comprising a PRO molecule) userui for the diagnosis or treatment of the disorders described above is provided. 
The article of manufacture comprises a container and an instruction. Suitable containers include, for example, 
bonles. vials, syringes, and test, tubes. The containers may be formed from a variety of materials such as glass 
or plastic. The container holds a composition which is effective for diagnosing or treating the condition and 
may have a sterile access port (for example the container may be an intravenous solution bag or a vial having a 
30 stopper pierccable by a hypodermic injection needle). The active agent in the composition is usually a 
polypeptide or an antibody of the invention. An instruction or label on, or associated with, the container 
indicates that the composition is used for diagnosing or treating the condition of choice. The article of 
manufacture may further comprise a second container comprising a pharmaceuticaily-acceptable buffer, such 
as phosphate-buffered saline. Ringer's solution and dextrose solution. It may further include other materials 
35 desirable from a commercial and user standpoint, including other buffers, diluents, filters, needles, syringes, 
and package inserts with instructions for use. 

P. Diagnosis and Prognosis of Immune Related Disease 

Cell surface proteins, such as proteins which are overexpressed in certain immune related diseases, are 
excellent targets for drug candidates or disease treatment The same proteins along with secreted proteins 
40 encoded by the genes amplified in immune related disease states find additional use in the diagnosis and 
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prognosis of these diseases. For example, antibodies directed against the protein products of genes amplified 
in multiple sclerosis, rheumatoid arthritis, or another immune related disease, can be used as diagnostics or 
prognostics. 

For example, antibodies, including antibody fragments, can be used to qualitatively or quantitatively 
detect the expression of proteins encoded by amplified or overcxpressed genes ("marker gene products"). The 
antibody preferably is equipped with a detectable, e.g.. fluorescent label, and binding can be monitored by light 
microscopy, flow cytometry, fluorimetry, or other techniques known in the art. These techniques are 
particularly suitable, if the overexpressed gene encodes a cell surface protein Such binding assays are 
performed essentially as described above. 

In situ, detection of antibody binding to the marker gene products can be performed, for example, by 
immunofluorescence or immunoelectron microscopy. For this purpose, a histological specimen is removed 
from the patient, and a labeled antibody is applied to it. preferably by overlaying the antibody on a biological 
sample. This, procedure also allows for determining the distribution of the marker gene product in the tissue 
examined. It will be apparent for those skilled in the an that a wide variety of histological methods are readily 
available for in situ detection. 

The following examples are offered lor illustrative purposes only, and are not intended to limit the 
scope of ihe present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 
reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection. Manassas. 
VA. Unless otherwise noted, the present invention uses standard procedures of recombinant DNA technology, 
such as those described hereinabove and in the following textbooks: Sambrook et aL. Molecular Cloning: A 
Laboratory Manual. Cold Spring Harbor Press N.Y.. 1989; Ausubei et aL Current Protocols in Molecular 
Biology, Green Publishing Associates and Wiley interscience, N.Y., 1989; Innis et aL. PCR Protocols: A 
Guide to Methods and Applications. Academic Press, inc.. N.Y., 1990; Harlow et aL Antibodies: A Laboratory 
Manual. Cold Spring Harbor Press, Cold Spring Harbor. 1988; Gait, M.J.. Oligonucleotide Synthesis, IRL 
Press, Oxford, 1984; R.l. Freshney, Animal Cell Culture, 1987; Coligan et aL Current Protocols in 
Immunology, 199!. 

EXAMPLE 1 

Isolation of cDNA' clones Encoding Human PRO200, PRO204, PR0212, PR0216. PR0226, PRO240, 
PR0235. PR0245, PR0172, PR0273, PR0272, PR0332, PR0526, PRO701, PR036L PR0362, PR0363, 
PR0364, PR0356, PR0531, PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788, PROl 114, PRO1007, 
PROl 184, PRO103I, PR01346, PROl 155, PRO1250, PR01312, PROl 192, PR01246. PROI283, PROl 195, 
PR01343, PR01418, PR01387, PRO1410, PR01917, PR01868, PRO205, PR021, PR0269, PR0344, 
PR0333. PR0381. PRO720, PR0866, PRO840, PR0982, PR0836, PROl 159, PR01358, PR01325, 
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PR01338. PR01434. PR04333. PRO4302. PRO4430 and PR05727 polypeptide. 

Various techniques were employed for isolating the cDNA clones described below. A general 
description of the methods employed follows immediately hereafter, whereas the details relating the specific 
5 sequences isolated is recited separately for each native sequence. It is understood that the actual sequences of 
the PRO polypeptides are those which are contained within or encoded by the clone deposited with the ATCC - 
and that in the in event of any discrepancy between the sequence deposited and the sequence disclosed herein, 
the sequence of the deposit is the true sequence 

10 ECD Homology: 

The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from 
about 950 known secreted proteins from the Swiss-Proc public database were used to search EST databases. 
The EST databases included public EST databases (e.g., GenBank), a private EST database (LIFESEQ* Incyte 
Pharmaceuticals. Palo Alto. CA), and proprietary ESTs from Genentech. The search was performed using the 

15 computer program BLAST or QLAST2 [Altschul et aL Methods in Enzymohgy. 266: 460-480 (1996)] as a 
comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons 
resulting in a BLAST score of 70 (or in some cases. 90) or greater that did not encode known proteins were 
clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green. University of 
Washington. Seattle. Washington). 

20 Using various ESTs. drawing from both public and private databases, a consensus DNA sequence was 

assembled. Oligonucleotides were then synthesized to identify by PCR a cDNA library that contained the 
sequence of interest and for use as probes to isolate a clone encoding the particular native sequence PRO 
. polypeptide identified herein. 

In order to screen several libraries for a source of a full- length, native sequence clone. DNA from the 

25 libraries was screened by PCR amplification with the PCR primer pair identified below. A positive library was 
then used to isolate clones encoding the particular native sequence PRO polypeptide using the probe 
oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from various human tissue libraries, 
including, e.g.. fetal lung, fetal liver, fetal brain, small intestine, smooth muscle cells, etc. The cDNA libraries 

30 used to isolated the cDNA clones were constructed by standard methods using commercially available reagents 
such as those from Invitrogen, San Diego, CA. The cDNA was primed with oiigo dT containing a NotI site, 
linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, 
and cloned in a defined orientation into a suitable cloning vector (such as pRKB; pRK5B is a precursor of 
pRKLSD that does not contain the Sfil site; see. Holmes et aL Science, 253:1278-1280 (1991)) in the unique 

35 Xhol and NotI sites. The clones were sequenced using known and readily available methodology. 

Amylase yeast screen: 

1. Preparation of oligo dT primed cDNA library 

mRNA was isolated from various tissues (e.g„ such as those indicated above under the ECD homology 
40 procedure) using reagents and protocols from Invitrogen, San Diego, CA (Fast Track 2). This RNA was used to 
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generate an oligo dT primed cDNA library in the vector pRK5D using reagents and protocols from Life 
Technologies, Gaithersburg, MD (Super Script Plasmid System). In this procedure, the double stranded cDNA 
was sized to greater than 1000 bp and the Sail/NotI linkered cDNA was cloned into XhoI/NotI cleaved vector. 
pRK5D is a cloning vector that has an sp6 transcription initiation site followed by an SGI restriction enzyme 
site preceding the XhoI/NotI cDNA cloning sites. 

2. Preparation of random primed cDNA library 

A secondary cDNA library was generated in order to preferentially represent the 5 f ends of the 
primary cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was 
used to generate a random primed cDNA library in the vector pSST-AMY.O using reagents and protocols from 
Life Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded 
cDNA was sized to 500-1000 bp, linkered with blunt to NotI adaptors, cleaved with Sfil, and cloned into 
Sfil/Notl cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter 
preceding the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion 
signal) followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus. cDNAs cloned 
into this vector that are fused in frame with amylase sequence will lead to the secretion of amylase from 
appropriately transfected yeast colonies. 

3. Transformation and Detection 

DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompetent DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 
electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies. 1 ml) was 
added and the mixture was incubated at 37°C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing ampicillin and incubated for 16 hours (37°C). Positive colonies were 
scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols. Cisco- 
gradient. The purified DNA was then carried on to the yeast protocols below. 

The yeast methods were divided into three categories: (1) Transformation of yeast with the 
plasmid/cDNA combined vector, (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR 
amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and 
further analysis. 

The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT 
alpha, ura3-52, leu2-3, Ieu2-U2, his3-ll, his3-l5, MAL', SUC\ GAL*. Preferably, yeast mutants can be 
employed that have deficient post- trans lationai pathways. Such mutants may have translocation deficient 
alleles in jec71, 5*c72. sec62. with truncated secl\ being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins 
implicated in this post translation pathway (e.g.. SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) or 
the complex formation of these proteins may also be preferably employed in combination with the amylase- 
expressing yeast 

Transformation was performed based on the protocol outlined by Gietz et aL NucL Acid Res., 20: 1425 
(1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) and 
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grown overnight at 30°C. The YEPD broth was prepared as described in Kaiser et ai.. Methods in Yeast 
Genetics, Cold Spring Harbor Press, Cold Spring Harbor. NY, p. 207 (1994). The overnight culture was then 
diluted to about 2 x 10 6 cells/ml (approx. OD^o = 0.1) into fresh YEPD broth (500 ml) and regrown to I x 10 7 
cells/ml (approx. OD<Hxr=0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a 
Sorval GS3 rotor at 5.000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile 
water, and centrifuged again in 50 ml falcon tubes at 3,500 rpm in a Beckman GS-6KR centrifuge. The 
supernatant was discarded and the cells were subsequently washed with LiAc/TE (10 mi, 10 mM Tris-HCI, I 
mM EDTA pH 7.5, 100 mM Li20OCCH3). and resuspended into LiAc/TE (2.5 ml). 

Transformation look place by mixing the prepared cells (100 ul) with freshly denatured single 
stranded salmon testes DNA (Lofstrand Labs. Gaithersburg, MD) and transforming DNA (I ug, vol. < 10 ul) 
in microruge tubes. The mixture was mixed briefly by vortexing, then 40% PEG/TE (600 ul, 40% polyethylene 
glycol-4000. 10 mM Tris-HCI, I mM EDTA, 100 mM Li2Ac, pH 7.5) was added. This mixture was gently 
mixed and incubated at 30°C while agitating for 30 minutes. The ceils were then heat shocked at 42°C for 15 
minutes, and the reaction vessel centnruged in a microruge at 12.000 rpm for 5-10 seconds, decanted and 
resuspended into TE (500 ul. 10 mM Tris-HCI. I mM EDTA pH 7.5) followed by recenmrugauon. The cells 
were then diluted into TE (1 mi) and aliquots (200 ul) were spread onto the selective media previously 
prepared in 150 mm growth plates ( VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using a single, 
large scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser et ai.. Methods in Yeast Genetics. Cold Spring Harbor Press. Cold Spring Harbor, NY, p. 
208-210 (1994). Transformants were grown at 30°C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
growth media. Starch was coupled to the red dye (Reactive Red- 120. Sigma) as per the procedure described 
by Biely et aL Anal. Biochem.. 172:176-179 (1988). The coupled starch was incorporated into the SCD-Ura 
agar plates at a final concentration of 0. 15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 
(50-100 mM Gnal concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in 
order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies 
were determined by their ability to break down starch resulting in a clear halo around the positive colony 
visualized directly. 

Isolation and sequencing by standard techniques identified a yeast EST fragment which served as the 
basis for additional database mining as described below. 

4. Assembly 

The yeast EST fragment identified above was used to search various expressed sequence tag (EST ) 
databases. The EST databases included public EST databases (e.g.. GenBank, Merck/Wash U) and a 
proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA). The search was 
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performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in Enzymology 
266:460-480 (1996)) as a comparison of the ECD protein sequences to a 6 frame translation of the EST 
sequence. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into, consensus DNA sequences with the program 
"phrap" (Phil Green. University of Washington, Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. The 
consensus DNA sequence was extended using repeated cycles of BLAST and phrap to extend the consensus 
sequence as far as possible using the sources of EST sequences discussed above as well as EST sequences 
proprietary to Genentech. 

Based on this consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 
library that contained the sequence of interest, and 2) for use as probes to isolate a clone encoding the 
particular PRO polypeptide. In order to screen several libraries for a full-length clone, DNA from the libraries 
was screened by PCR amplification, as per Ausube! et aL, Current Protocols in Molecular Biology, with the 
PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest using the 
probe oligonucleotide and one of the primer pairs. 

RNA for construction of the cDNA libraries was isolated from various human tissues. The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen. San Diego, CA. The cDNA was primed with oligo dT containing a 
Notl site, linked with blunt to Sail hemikinased adaptors, cleaved with NotL sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD: 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; Holmes et aL Science, 253: 1278-1280 
(1991)) in the unique Xhol and Notl sites. 

Signal algorithm: 

A proprietary signal sequence finding algorithm developed by Genentech. Inc was used upon 
Expressed Sequence Tags (ESTs) and on clustered and assembled EST fragments from public (e.g.. GenBank) 
and/or private (Lifeseq\ Incyte Pharmaceuticals. Inc.. Palo Alto, CA) databases. The signal sequence 
algorithm computes a secretion signal score based on the character of the DNA nucleotides surrounding the 
first and optionally the second methionine codon(s) (ATG) at the 5*-end of the sequence or sequence fragment 
under consideration. The nucleotides following the first ATG must code for at least 35 unambiguous amino 
acids without any stop codons. If the first ATG has the required amino acids, the second is not examined. If 
neither meets the requirement the candidate sequence is not scored. In order to determine whether the EST 
sequence contains an authentic signal sequence, the DNA and corresponding amino acid sequences 
surrounding the ATG codon arc scored using a set of seven sensors (evaluation parameters) known to be 
associated with secretion signals. 

The above procedure resulted in the identification of EST sequences which were compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases {e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA). The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in Enzymology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
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that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program *phrap" (Phil Green, University of Washington, Seattle, Washington). This resulted in the 
identification of additional EST sequences which either corresponded to full-length clones, which were 
examined and sequenced or served as a template for the creation of cloning oligonucleotides which were then 
used to screen various tissue libraries resulting in isolation of DNA encoding a native sequence PRO 
polypeptide. 

A. Isolation of cDNA clones Encoding Human PRO200 (UNO 174) 

Probes based on an expressed sequence tag (EST) identified from the Incyte Pharmaceuticals database 
due to homology with VEGF were used to screen a cDNA library derived from the human glioma cell line 
G61 . Screening may be conducted in a manner similar to the procedure disclosed elsewhere in this application. 
In particular, Incyte Clone "INC13025I6" was used to generate the following four probes: 
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5'-ACTTCTCAGTGTCCATAAGGG-3' (SEQ j D N0;3) 

5'-GAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTC-3' (SEQ ID NO:4) 

5'-CACCACAGCGTTTAACCAGG-3' (SEQIDNO:5) 
5 , -ACAACAGGCACAGTTCCCAC-3 , (SE q ro NO:6) 

Nine positives were identified and characterized. Three clones contained the full coding region and 
were identical in sequence. Partial clones were also identified from a fetal lung library and were identical with 
the glioma-derived sequence with the exception of one nucleotide change which did not alter the encoded 
amino acid. 

For mammalian protein expression, the entire open reading frame (ORF) was cloned into a 
CMV-based expression vector. An epitope-tag (FLAG. Kodak) and Histidine-tag (His8)- were inserted 
between the ORF and stop codon. UNO 1 74-His8 and UNO 1 74-FLAG were transfected into human embryonic 
kidney 293 cells by SuperFect (Qiagen) and pulse-labeled for 3 hours with ["Sjmethionine and [ w C]cysteine. 
Both epitope-tagged proteins co-migrate when 20 microliters of 15-fold concentrated serum-free conditioned 
medium were electrophoresed on a polyacrylamide gel (Novex) in sodium dodecyl sulfate sample buffer 
(SDS-PAGE). The UNQ174-IgG expression plasmid was constructed by cloning the ORF in front of the 
human Fc (IgG) sequence. 

The UNO 1 74-IgG plasmid was co-iransfected with Baculogold Baculovirus DNA (Pharmingen) using 
Lipofectin (GibcoBRL) into 10 s Sf9 cells grown in Hink's TNM-FH medium (JRH Biosciences) supplemented 
with 10% fetal bovine serum. Cells were incubated for 5 days at 28°C. The supernatant was harvested and 
subsequently used for the first viral amplification by infecting Sf9 cells at -an approximate multiplicity of 
infection (MOI) of 10. Cells were incubated for 3 days, then supernatant harvested, and expression of the 
recombinant plasmid determined by binding of I ml of supernatant to 30 ul of Protein-A Sepharose CL-4B 
beads (Pharmacia) followed by subsequent SDS-PAGE analysis. The first amplification supernatant was used 
to infect a 500 ml spinner culture of Sf9 cells grown in ESF-921 medium (Expression Systems LLC) at an 
approximate MOI of 0.1. Cells were treated as above, except harvested supernatant was sterile filtered. 
Specific protein was purified by binding to Protem-A Sepharose 4 Fast Flow (Pharmacia) column. 

The entire nucleotide sequence of the identified clone DNA29101 is shown, in Figure 1 (SEQ ID 
NO:l). Clone DNA29I01 (SEQ ID NO: I) contains a single open reading frame with an apparent translation 
initiation site at nucleotide residues 285-287 and ending at the stop codon (TAG) found at nucleotide positions 
1320-1322 (Figure 1, SEQ ID NO:l), as indicated by bolded underline. The predicted PRO200 polypeptide 
precursor (/.*.. UNQ174, SEQ ID NO:2) is 345 amino acids in length, has a calculated molecular weight of 
39029 daltons. a pi of 6.06 and is shown in Figure 2 (SEQ ID NO:2). Potential N-glycosylation sites are at 
amino acid residues 25, 54 and 254. CUB domains are at amino acid residues 52-65, 1 18- 125 and 260-273. 

A cDNA containing DNA encoding UNQ174 (SEQ ID NO:2) has been deposited with the ATCC on 
March 5, 1 998 and has been assigned deposit number 209653. 

B. Isolation of cDNA clones Encoding Human PRO204 (UNQ178) 

An expressed sequence tag (EST) DNA database (LIFESEQ", Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified. Human fetal retina cDNA libraries were screened with PCR 
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oligonucleotide primers and con tinned by hybridization with synthetic oligonucleotide probe which was based 
upon the EST sequence. 
hybridization probe : 

S'-GGCATGCAGCAGCTGGACATTTGCGAGGGCTTTTGCTGGCTGO' (SEQ ID NO:7) 
5 forward PCR prim en 

S'-CTGCTGCAGAGTTGCACGAACO* (SEQ ID NO:8) 

reverse PCR primer 1 : 

5'-CAGTTGTTGTTGTCACAGAGAAG-3' (SEQ ID NO:9) 

reverse PCR primer 2 : 

10 S'-AGTTCGTGCAACTCTGCAGCAGO* (SEQIDNO:10) 

A cDNA clone was identified and sequenced in entirety. The entire nucleotide sequence of the 
identified clone DNA30871 is shown in Figure 3 (SEQ ID NO:l 1). Clone DNA30871-1 157 (SEQ ID NO:l I) 
contains a single open reading frame with an apparent translation initiation site at nucleotide positions 376-378 
and ending at the stop codon (TAA) found at nucleotide positions 1498-1500 (Figure 3: SEQ ID NO: 1 1), as 

15 indicated by bolded underline. The predicted PRO204 polypeptide precursor (i.e.. UNQ178. SEQ ID NO: 12) 
is 374 amino acids lone, has a calculated molecular weight of 39,285 daltons. a pi of 6.06 and is shown in 
Figure 4. A cDNA containing DNA encoding UNQ178 (SEQ ID NO: 12) has been deposited with the ATTC 
on October 16, 1997 and has been assigned deposit number 209380. 

20 C. Isolation of cDNA clones Encoding Human PRQ212 fUNO|86) 

Use of the ECD homology procedure described above from a human fetal lung library resulted in the 
identification of the full-length DNA sequence for DNA30942 (Fig. 5; SEQ ID NO: 13) and the derived protein 
sequence UNQ186 (Fig. 6; SEQ ID NO: 14). 

The PCR primers (forward and reverse) and probes used in the procedure were the following: 
25 forward primer: 5 , -CACGCTGGTTTCTGCTrGGAG-3' (SEQ ID NO: 15) 

reverse primer 5'- AGCTGGTGC AC AGGGTGTCATG-3' (SEQ ID NO: 1 6) 

hybridization probe: (SEQ ID NO: 1 7) 

5 , -CCCAGGCACCTTCTCAGCCAGCCAGCAGCTCCAGCTCAGAGCAGTGCCAGCCC-3 , 

The entire nucleotide sequence of DNA30942 is shown in Figure 5 (SEQ ID NO: 13). Clone 
30 DNA30942 (SEQ ID NO: 13) contains a single open reading frame with an apparent translation initiation site at 
nucleotide positions 101-103 and ending at the stop codon (TGA) at positions 1001-1003 (Fig. 5; SEQ ID 
NO: 13), as indicated in bolded underline. The predicted PR0212 polypeptide precursor of Fig. 6 (SEQ ID 
NO: 14) is 300 amino acids long, has a calculated molecular weight of 32680 daltons and a pi of 8.70. It is 
believed that the PR0212 sequence of Fig. 6 (SEQ ID NO: 14) lacks a transmembrane domain. It is also 
35 believed that amino acids 1 to 215 of Fig. 6 (SEQ ID NO: 14) represents an ECD which includes four cysteine 
rich domains (CRDs). A cDNA clone containing DNA30942 (SEQ ID NO: 13) has been deposited with ATCC 
(identified as DNA30942-1 134) on September 16, 1997 and has been assigned ATCC deposit no. 209254. 
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D. Isolation of cDNA clones Encoding Human PR0216 (UNO 190) 

A procedure analogous to the one above for the isolation of PR0212 can be employed to isolate 
DNA33087 (SEQ ID NO: 18) (Figure 7) which encodes the PR0216 polypeptide UNQ190 (SEQ D 
5 NO:I9)(Figure 8). 

DNA33087 contains a single open reading frame with an apparent translation initiation site nucleotide 
residues 268-270 and ending at the stop codon (TAG) are residues 1531-1533 (Fig. 7, SEQ ID NO: 18), as 
indicated by bolded underline. The predicted PR0215 polypeptide precursor (i.e.. UNQ190, SEQ ID NO:I9) 
is 421 amino acids long, has a calculated molecular weight of 49492 daltons and a pi of 5.51 (Fig. 8). 

10 Hydropathy analysis suggests the presence of a signal sequence at amino acid residues 1 to 20, 

tyrosine kinase phosphorylation sites at amino acid residues 268-274 and 300-306, and N-myristoyiation site 
residue 230-235, and leucine zippers at residues 146 to 167 and 217 to 238. Alternatively to traditional 
isolation techniques, the DNA sequence is publicly available from GenBank as accession number AB0001 14 
which encodes DayhofT protein AB0001 14_1 . 

15 Alternatively still, the sequence is described in Oh no et at.. Biochem. Biophys. Res. Commun, 228(2): 

411-414 (1996). A cDNA clone containing DNA33087 (identified as DNA33087-1 158) has been deposited 
with the American Type Culture Collection (ATCC) on September 16, 1997 and has been assigned ATCC 
Dep. No. 209381. 

20 E. Isolation of cDNA clones Encoding Human PR0226 (UNQ20O) 

Use of the ECD homology procedure described above in a human fetal lung library resulted in the 
identification of the full-length DNA sequence for DNA33460 (Figure 9; SEQ ID NO:20) and the derived 
native sequence protein UNQ200 (SEQ ID NO:21). 

DNA33460 contains a single open reading frame with an apparent translation initiation site at 
25 nucleotide residues 62-64 and ending at the stop codon (TGA) at residues 1391-1393 (Fig. 9; SEQ ID NO: 20), 
as indicated by bolded underline. The predicted PR0226 polypeptide precursor {i.e.. UNQ200. SEQ ID 
NO:21) is 443 amino acids long, has a calculated molecular weight of 49,391 daltons, a pi of 4.82 and is shown 
in Figure 10 as UNQ200 (SEQ ID NO:21). A cDNA clones containing DNA33460 (SEQ ID NO:20), 
designated as DNA33460- 1 1 66. has been deposited with the ATCC on October 16, 1997 and has been assigned 
30 ATCC deposit number 209376. 

The oligonucleotide sequences used in the above procedure were the following: 
28722.p(OLI488) (SEQ ID NO: 22) 

5 , -TGTGTGGACATAGACGAGTGCCGCTACCGCTACTGCCAGCACCGC-3 l 

28722.f(OLI489) (SEQ ID NO: 23) 

35 5'-AGGACTGCCATAACTTGCCTG-3' 

28722.r(OLI490) * (SEQ ID NO: 24) 

5 ! -ATAGGAGTTGAAGCAGCGCTGC-3 f 
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F. Isolation of cDNA clones Encoding Human PRO240 (UNQ214) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the full-length DNA sequence for DNA34387 (Figure 11; SEQ ID NO:25) and the derived native 
sequence protein UNQ214 (SEQ ID NO:26). 

The entire nucleotide sequence of DNA34387 is shown in Figure 1 1 (SEQ ID NO:25). The clone 
DNA34387 contains a single open reading frame with an apparent translation initiation site at nucleotide 
positions 12-14 and ending at the stop codon (TGA) at nucleotide positions 699-701 (Fig. 1 1; SEQ ID NO:25), 
as indicated by bolded underline. The predicted PRO240 polypeptide precursor (/.&. UNQ2I4, SEQ ID 
NO:26) is 229 amino acids long, has a calculated molecular weight of 24.689 daitons. a pi of 7.83 and is shown 
in Figure 12. A cDNA clone containing DNA34387 (SEQ ID NO:25) has been deposited with ATCC on 
September 16, 1997 and is assigned ATCC deposit no. 209260. 

The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above- 
described procedure were the following: 

forward PCR primer: S'-TCAGCTCCAGACTCTGATACTGCC-T (SEQ ID NO:27) 

reverse PCR pnmer: 5^TGCCTTTCTAGGAGGCAGAGCTCC-3' (SEQ ID NO:28) 

hybridization probe: (SEQ ID NO:29) 

5 , -GGACCCAGAAATGTGTCCTGAGAATGGATCTTGTGTACCTGATGGTCCAG-3* 

G. Isolation of cDNA clones Encoding Human PRQ235 (UNQ209) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the full-length DNA sequence for DNA35558 (Figure 13; SEQ ID NO:30) and the derived 
PR0235 native sequence protein UNQ209 (Fig. 14, SEQ ID NO:3 1 ). 

The enure nucleotide sequence of DNA35558 is shown in Figure 13 (SEQ ID NO:30). The 
DNA35558 clone shown in Figure 13 contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 667-669 and ending at the stop codon (TGA) at nucleotide positions 2323- 
2325, as indicated by bolded underline. The predicted PR0235 polypeptide precursor {Le., UNQ209. SEQ ID 
NO:3 1 ) is 552 amino acids long, has a calculated molecular weight of 6 1 ,674 daitons and a pi of 6.95 (Figure 
14). A cDNA clone containing DNA35558 has been deposited with ATCC on October 16, 1997 and is 
assigned ATCC deposit no. 209374. 

The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

forward PCR primer: 5 , -TGGAATACCGCCTCCTGCAG-3' (SEQ ID NO:32) 

reverse PCR primer: y-CTTCTGCCCTTTGGAGAAGATGGC-y (SEQ ID NO:33) 

hybridization probe: 

5'-GGACTCACTGGCCCAGGCCTTCAATATCACCAGCCAGGACGAT-3' (SEQ ID NO:34) 
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H. 



Isolation of cDNA Clones Encoding Human PRQ245 (UNQ219) 



Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the full-length DNA sequence for DNA35658 (Figure 15, SEQ ID NO:35) and the derived PR0245 
native sequence protein UNQ2 1 9 (Figure 1 6, SEQ ID NO:36). 
5 The PCR primers (forward and reverse) and hybridization probes synthesized for use with the above- 

described method were the following: 

forward PCR primer 5'-ATCGTTGTGAAGTTAGTGCCCC-3' (SEQ ID NO:37) ' 

reverse PCR primer 5-ACCTGCGATATCCAACAGAATTG-3' (SEQ ID NO:38) 

hybridization probe (SEQ ID NO:39) 

10 5'-GGAAGAGGATACAGTCACTCTGGAAGTATTAGTGGCTCCAGCAGTTCC-3' 

The entire nucleotide sequence of DNA35638 (SEQ ID NO:35) is shown in Figure 15. Clone 
DNA35638 contains a single open 'reading frame with an apparent translation initiation site at nucleotide 
positions 89-91 and ending at the stop codon (TAG) at nucleotide positions 1025-1027 (Fig. 15; SEQ ID 
NO:35). The predicted PR0245 polypeptide precursor (U. t UNQ2I9, SEQ ID NO:36) is 312 amino acids 

15 long, has a calculated molecular weight of 34.554 daltons and a pi of 9.39 (Fig. 36). A clone containing 
DNA35638 (SEQ ID NO:35). designated as DNA35638-1 141, has been deposited with ATCC on September 
16. 1997 and is assigned ATCC deposit no. 209265. 

I. Isolation of cDNA clones Encoding Human PRQ172 (LTNQ146) 

20 Use of the ECD homology procedure described above Li a human fetal kidney library resulted in the 

isolation of the full-length DNA sequence for DNA35916 (Fig. 17; SEQ ID NO:40) and the derived PR0172 

native sequence protein UNQ146 (Fig. 18, SEQ ID NO:4l). 

Clone DNA35916 (SEQ ID NO:40) contains a single open reading frame with an apparent translation 

initiation site at nucleotide positions 38-40 and ending at the stop codon (TAA) at nucleotide positions 2207- 
25 2209. as indicated by bolded underline in Fig. 17. The predicted PR0172 polypeptide precursor (/.<?.. 

UNQ146; SEQ ID NO:4l) is 723 amino acids long, has a calculated molecular weight of 78.055 daltons and a 

pi of 6.17 (Fig. 18). A cDNA clone containing DNA35916 (SEQ ID NO:40) has been deposited with ATCC 

on October 28, 1997 (designated as DNA35916-1 161) and has been assigned ATCC deposit no. 209419. 



30 28765.p (OLI633) 

5 r -AAATCTGTGAATTGAGTG CC ATGGACCTGTTGCGGACGGCCCTTGCTT-3 t (SEQ ID NO:42) 
28765.f(OLI644) 



The oligonucleotide sequences used in the above procedure were the following: 



5'-GGATCTCGAGAACAGCTACTCC-3* 



(SEQ ID NO:43) 



28765.r (OLI645) 



35 



S'-TCGTCCACGTTGTCGTCACATGO* 



(SEQ ID NO:44) 
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J. Isolation of cDNA clones Encoding Human PRQ273 (UNQ240) ' 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
isolation of the full-length DNA sequence for DNA39523 (Fig. 19; SEQ ID NO:45) and the derived PR0273 
native sequence protein UNQ240 (Fig. 20, SEQ ID NO:46). 

The PCR primers (forward and reverse) and hybridization probe synthesized were the following: 
forward PCR primer: S'-CAGCGCCCTCCCCATGTCCCTG-T (SEQ ID NO:47) 

reverse PCR primer S'-TCCCAACTGGTTTGGAGTTnCCCO' (SEQIDNO:48) 
hybridization probe: 

5'-CTCCGGTCAGCATG AGGCTCCTGGCGGCCGCTGCTCCTGCTGCTG-3' (SEQ ID NO:49) 

Clone DNA39523 (SEQ ID NO:45) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 167-169 and ending at the stop codon (TAG) at nucleotide positions 500- 
502 (Figure 19), as indicated by bolded underline. The predicted PR0273 polypeptide precursor (i.e., 
UNQ240, SEQ ID NO:46) is 1 1 1 amino acids long, has a calculated molecular weight of 13,078 daltons and a 
pi of 10.37 (Figure 20). A cDNA clone including DNA39523 (SEQ ID NO:45) has been deposited with 
ATCC on October 3 1. 1997 and is assigned ATCC deposit no. 209424. 

K. Isolation of cDNA clones Encoding Human PR0272 (UNQ239) 

Use of the ECD homology procedure described above in a human fetal lung tissue in combination 
with an in vivo cloning procedure using the probe oligonucleotide and one of the primer pairs resulted in the 
identification of the full length DNA sequence for DNA40620 (Fig. 2i, SEQ ID NO:50) and the derived 
PR0272 native sequence protein UNQ239 (SEQ ID NO:51). 

The forward and reverse PCR primers and hybridization probes synthesized and used to isolate the 
PR0272 encoding DNA sequences were the following: 

forward PCR primer (.fl ): 5'-CGC AGGCCCTCATGGCCAGGO* (SEQ ID NO:52) 

forward PCR primer (.f2): 5'-GAAATCCTGGGTAATTGG-r (SEQ ID NO:53) 

reverse PCR primer 5'-GTGCGCGGTGCTCACAGCTCATC-3' (SEQ ID NO:54) 

hybridization probe: 

5 T -CCCCCCTGAGCGACGCTCCCCCATGATGACGCCCACGGGAACTTC-3' (SEQ ID NO:55) 

Clone DNA40620 (SEQ ID NO:50) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 35-37 and ending at the stop codon (TGA) at nucleotide positions 1020- 
1022 (Figure 21), as indicated by bolded underline. The predicted polypeptide precursor is 328 amino acids 
long (Figure 22), has a calculated molecular weight of 37,493 daltons and a pi of 4.77. A cDNA clone 
containing DNA40620 (SEQ ID NO:50) has been deposited with ATCC on October 17, 1997 and is assigned 
ATCC deposit no. 209388. 

L. Isolation of cDNA clones Encoding Human PRQ332 (UNQ293) 

Use of the ECD homology procedure described above in a human fetal liver library resulted 
in the identification of the full-length DNA sequence for DNA40982 (Fig. 23, SEQ ID NO:56) and the derived 
PR0332 native sequence protein UNQ293 (Fig. 24, SEQ ID NO:57). 
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The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

y-GCATTGGCCGCGAGACTTTGCC-T (SE q ID NO: 58) 

S'-GCGGCCACGGTCCTTGGAAATGO' (SE q ^ NO:59) 

S'-TGGAGGAGCTCAACCTCAGCTACAACCGCATCACCAGCCCACAGGO' (SEQ ID NO:60) 

The entire nucleotide sequence of DNA40982 (SEQ ID NO:56) is shown in Figure 23. Clone 
DNA40982 (SEQ ID NO:56) contains a single open reading frame with an apparent translation initiation site at 
nucleotide positions 342-344 and ending at the stop codon (TAG) at nucleotide positions 2268-2270, as 
indicated in Figure 23 by bolded underline. The predicted PR0332 polypeptide precursor (/.&. UNQ293, SEQ 
ID NO:57, Fig. 24) is 642 amino acids long, and has a calculated molecular weight of 72,067, and a pi of 6.60. 
A cDNA clone containing DNA40982 (SEQ ID NO:56) (designated as DNA40982-1235) has been deposited 
with ATCC on November 7, 1997 and is assigned ATCC deposit no. 209433. 

M - Isolation of cDNA clones Encodinc Human PRQ526 (UNQ330) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
identification of the full-length DNA sequence DNA44184 (Fig. 25, SEQ ID NO:61) and the derived PR0526 
native sequence protein UNQ330 (Fig. 26, SEQ ID NO:62). 

The PCR primers (forward and reverse) and hybridization probes synthesized were the following: 
forward PCR primer: y-TGGCTGCCCTGC ACT ACCTCTACC-3' (SEQ ID NO:63) 

reverse PCR primer 5'-CCCTGCAGGTCATTGGCAGCTAGG-3' (SEQ ID NO:64) 

hybridization probe: (SEQ ID NQ:65) 

5 , -AGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCACAC-3 i . 

Clone DNA44I84 (SEQ ID NO:6l) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 514-516 and ending at the stop codon (TGA) at nucleotide positions 1933- 
1935 (Figure 61), as indicated by bolded underline. The predicted PROS26 polypeptide precursor (i.e.. 
UNQ330, SEQ ID NO:62) is 473 amino acids long (Figure 62). The UNQ330 (SEQ ID NO:62) protein shown 
in Figure 62 has an estimated molecular weight of about 50708 daltons and a pi of about 9.28. A cDNA clone 
containing DNA44I84 has been deposited with the ATCC on 26 March 1998 (under the designation 
DNA44 184-1319) and is assigned deposit number 209704. 

Analysis of UNQ330 (SEQ ID NO:62) revels that the signal peptide sequence is at about amino acids 
1-26. A leucine zipper partem is at about amino acids 135-156. A giycosaminoglycan attachment is at about 
amino acids 436-439. N-glycosylation sites are at about amino acids 82-85, 179-182, 237-240 and 423-426. A 
von Willebrand factor (VWF) type C domain(s) is found at about amino acids 41 1-425. The skilled artisan 
can understand which nucleotides correspond to these amino acids based on the sequences provided herein. 

N- Isolation of cDNA clones Encoding Human PRO701 (UNQ365) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
identification of the full-length DNA sequence DNA44205 (Fig. 27, SEQ ID NO:66) and the derived PR0526 
native sequence protein UNQ3 65 (Fig. 28, SEQ ID NO:67). 



97 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCT/US00/05841 



The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

5'-GGCAAGCTACGGAAACGTCATCGTG-3' (SEQ ID NO:68) 

S'-AACCCCCGAGCCAAAAGATGGTCACO' (SEQ ID NO:69) 

5 S'-GTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAGO' (SEQ ID NO:70) 

Clone DNA44205 (SEQ ID NO:66) contains a single open reading frame (with an apparent translation 

initiation site at nucleotide positions 50-52 and ending at the stop codon (TAG) at nucleotide positions 2498- 

3000, as indicated by bolded underline in Figure 27. The predicted PRO701 polypeptide precursor (/>. Fig. 

28, UNQ365, SEQ ID NO:67) is 816 amino acids long, and has a calculated molecular weight of 91/794 Da 
10 (pi: 5.88). A cDNA clone containing DNA44205 (SEQ ID NO:66) (designated as DNA44205-I285) has been 

deposited with ATCC on March 31, 1998 and is assigned ATCC deposit no. 209720. 

UNQ365 (SEQ ID NO:67) contains a potential signal peptide cleavage site at about amino acid 

position 25. There are potential N-giycosylation sites at about amino acid positions 83, 511, 716 and 803. The 

carboxyiesterases rype-B signature 2 sequence is at about residues 125 to 135. Regions homologous with 
15 carboxylesterase type-B are also at about residues 54-74. 197-212 and 221-261. A potential transmembrane 

region corresponds approximately to amino acids 671 through about 700. The corresponding nucleic acids can 

be routinely determined from the sequences provided herein. 



O. Isolation of cDNA clones Encoding Human PRQ361 fUNQ316) 
20 Usc of the ECD homology procedure described above in combination with an in vivo cloning 

procedure using the probe oligonucleotide and one of the primer pairs in a human fetal kidney library resulted 

in the identification of the full-length DNA sequence DNA45410 (Fig. 29. SEQ ID NO:71) and the derived 

PR0361 native sequence protein LTNQ316 (Fig. 30, SEQ ID NO:72). 

The forward and reverse PCR primers and a hybridization probe were synthesized for use in the 
25 above-described method: 

forward PCR primer (,fl):* 

5^AGGGAGGATTATCCTTGACCITTGAAGACC-3 ( (SEQ ID NO:73) 

forward PCR primer CD.): 5 l -GAAGCAAGTGCCCAGCTC-3 f (SEQ ID NO:74) 

forward PCR primer U3): ^-CGGGTCCCTGCTCTTTGGO' (SEQ ID NO:75) 

30 reverse PCR primer (.rl ): 5'-CACCGTAGCTGGGAGCGCACTCAC-3' (SEQ ID NO:76) 

reverse PCR primer (,r2): 5 , -AGTGTAAGTCAAGCTCCC-3 t (SEQ ID NO:77) 

hybridization probe : 

5'- GCTTCCTGACACTAAGGCTGTCTGCTAGTCAGAATTGCCTCAAAAAGAG-3' (SEQ ID NO:78) 
Clone DNA45410 (SEQ ID NO:71) contains a single open reading frame with an apparent translation 

35 initiation site at nucleotide positions 226-228 and ending at the stop codon (TAA) at nucleotide positions 1519- 
1521 (Figure 29), as indicated by bolded underline. The predicted PR0361 polypeptide precursor (i.e., 
UNQ316, SEQ ID NO:72) is 431 amino acids long (Figure 30). The native sequence PR0361 protein shown 
in Figure 30 as UNQ3 16 has an estimated molecular weight of about 468 10 and a pi of about 6.45. In addition, 
regions indicative of the arginase family proteins are present at about residues F3 to V14 and again at 139 to 

40 T57, while a transmembrane domain exists at about residues P380 to S409. A cDNA clone containing 
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DNA45410 (SEQ ID NO:71) has been deposited with ATCC on February 5. 1998 and is assigned ATCC 
deposit no. 209621. 



P- Isolation of cDNA clones Encoding Human PRQ362 (UNQ317) 

Use of the ECD homology procedure described above in a human fetal brain library resulted in the 
isolation of the full-length DNA sequence DNA454I6 (Fig. 31, SEQ ID NO:79) and the derived PR0362 
10 native sequence protein UNQ3 17 (Fig. 32, SEQ ID NO:80). 

The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

forward PCR primer 1: ^TATCCCTCCAATTGAGCACCCTGG^ (SEQlDNO:81) 

forward PCR primer 2: 5 f -GTCGGAAGACATCCCAACAAG-3' (SEQ ID NO:82) 

1 5 reverse PCR primer 1 : 5'-CTTC ACAATGTCGCTGTGCTGCTC-3* (SEQ ID NO:83) 

reverse PCR primer 2: 5'-AGCCAAATCCAGCAGCTGGCTTAC-3' (SEQ ID NO:84) 

hybridization probe : 

5 , -TGGATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTGAT-3* (SEQ ID NO:85) 

Clone DNA45416 (SEQ ID NO:79) contams a single open reading frame with an apparent translation 

20 initiation site at nucleotide positions 1 19-121 and ending at the stop codon (TAA) at nucleotide positions 1082- 
1084 (Figure 31), as indicated by bolded underline. The predicted PR0362 polypeptide precursor (i.e., 
UNQ317, SEQ ID NO:80) is 321 amino acids long (Figure 32). The UNQ317 protein (SEQ ID NO:80) shown 
in Figure 32 has an estimated molecular weight of about 35,544 daitons and a pi of about 8.5 1 . Analysis of the 
UNQ3I7 polypeptide as shown in Figure 32 evidences the presence of a glycosaminoglycan attachment site at 

25 about amino acid 149 to about amino acid 152 and a transmembrane domain from about amino acid 276 to 
about amino acid 306. A cDNA clone containing DNA45416 (SEQ ID NO:79) has been deposited with ATCC 
on February 5, 1998 and is assigned ATCC deposit no. 209620. 

Q. Isolation of cDNA clones Encoding Human PRQ363 (UNQ318) 
30 Use of the ECD homology described above in a human fetal kidney library resulted in the isolation of 

the full-length DNA sequence DNA45419 (Fig. 33, SEQ ID NO:86) and the derived PR0363 native sequence 
protein UNQ3 18 (Fig. 34, SEQ ID NO:87). 

The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 
35 forward PCR primer 

5-CCAGTGCACAGCAGGCAACGAAGC-3* (SEQ ID NO:88) 
reverse PCR primer 

S'-ACTAGGCTGTATGCCTGGGTGGGCO' (SEQ ID NO:89) 
hybridization probe : 

40 S'-GTATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGCO' (SEQ ID NO:90) 
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Clone DNA45419 (SEQ ID NO:86) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 190-192 and ending at the stop codon (TGA) at nucleotide positions 1309- 
1311 (Figure 33), as indicated by bolded underline. The predicted PR0363 polypeptide precursor (i.e., 
UNQ318, SEQ ID NO:87) is 373 amino acids long (Figure 34). The UNQ3I8 protein (SEQ ID NO:87) shown 
in Figure 34 has an estimated molecular weight of about 41,281 daltons and a pi of about 8.33. Analysis of the 
UNQ318 polypeptide as shown in Figure 34 evidences the presence of a transmembrane domain at about 
amino acid residue 221 to about residue 254. A cDNA clone containing DNA45419 (SEQ ID NO:86) has been 
deposited with ATCC on February 5, 1998 and is assigned ATCC deposit no. 209616. 

R. Isolation of cDNA clones Encoding Human PRQ364 (UNQ319) 

Use of the ECD homology procedure described above in a human small intestine library resulted in 
the identification of an expressed sequence tag (EST) (Incyte EST No. 3003460) that encoded a polypeptide 
which showed homology to members of the tumor necrosis factor receptor (TNFR) family of polypeptides. 

A consensus DNA sequence was then assembled relative to the Incyte 3003460 EST in a manner 
similar to that used in the ECD homology procedure which resulted in the isolation of the full-length DNA 
sequence DNA47365 (Fig. 35. SEQ ID NO:91) and the derived PR0364 native sequence protein UNQ3I9 
(Fig. 36, SEQ ID NO:92). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use in the above- 
described screening procedure were: 

forward PCR primer (44825.fl) : 5'-CACAGCACGGGGCGATGGG-3* (SEQ !D NO:93) 

forward PCR primer (448 25. f2) : 5'-GCTCTGCGTTCTGCTCTG-3' (SEQ ID NO:94) 

forward PCR primer (44825.GITR.n : 

5'-GGCACAGCACGGGGCGATGGGCGCGTTT-3 , (SEQ ID NO:95) 

reverse PCR primer (44825.rl) : 5 , -CTGGTCACTGCCACCTTCCTGCAC-3 , (SEQ ID NO:96) 

reverse PCR primer (44825.r2) : 5'-CGCTGACCCAGGCTGAG-3' (SEQ ID NO:97) 
reverse PCR primer (44825.GITR.r) : 

5'-GAAGGTCCCCGAGGCACAGTCGATACA-3' (SEQ ID NO:98) 

hybridization probe (44825.pl ) : 

5 l -GAGGAGTGCTGTTCCGAGTGGGACTGCATGTGTGTCCAGC-3 t (SEQ ID NO:99) ' 

hybridization probe (44825.GITR.p) : 

5'-AGCCTGGGTCAGCGCCCCACCGGGGGTCCCGGGTGCGGCC-3' (SEQ ID NO: 100) 

Clone DNA47365 (SEQ ID NO:9l) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 121-123 and ending at the stop codon (TGA) at nucleotide positions 844- 
846 (Figure 35). as indicated by bolded underline. The predicted PR0364 polypeptide precursor (Le., 
UNQ3 19, SEQ ID NO:92) is 241 amino acids long (Figure 36). The UNQ319 (SEQ ID NO:92) protein shown 
in Figure 36 has an estimated molecular weight of about 26,000 daltons and a pi of about 6.34. A potential N- 
glycosylation sites exists between amino acids 146 and 149 of the amino acid sequence shown in Figure 36. A 
putative signal sequence is from amino acids I to 25 and a potential transmembrane domain exists between 
amino acids 162 to 180 of the sequence shown in Figure 36. A cDNA clone containing DNA47365 
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(designated DNA47365-1206) has been deposited with ATCC on November 7. 1997 and is assigned ATCC 
Deposit No. ATCC 209436. 

S. Isolation of c DNA clones Encoding Human PRQ356 (UNQ313)(NL4) 

An expressed sequence tag (EST) DNA database (UFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST (#2939340) was identified which showed homology to human TIE-2 LI and TIE-2 
L2. 

Based on the EST, a pair of PCR primers (forward and reverse), and a probe were synthesized: 
NL4.5- 1 : 5'-TTCAGCACCAAGGACAAGGACAATGACAACT-3' (SEQ ID NO: 103) 

NL4.3- 1 : 5'-TGTGCACACTTGTCCAAGCAGTTGTCATTGTC-3' (SEQ ID NO: 104) 

NL4.3-3: 5'-GTAGTACACTCCATTGAGGTTGG-3' (SEQ ID NO: 105). 

Oligo dT primed cDNA libraries were prepared from uterus mRNA purchased from Clontech, Inc. 
(Palo Alto. CA, USA, catalog # 6537-1) in the vector pRKSD using reagents and protocols from Life 
Technologies, Gauhersburg, MD (Super Script Plasmid System). pRK5D is a cloning vector that has an s P 6 
transcription initiation site followed by an Sfil restriction enzyme site preceding the Xhol/Notl cDNA cloning 
sites. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased 
adaptors, cleaved with NotI. sized to greater than 1000 bp appropriately by gel electrophoresis, and cloned in a 
defined orientation into XhoI/Notl-cleaved pRK5D. 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0356 gene using the probe oligonucleotide and one of the PCR primers. 

DNA sequencing of the clones isolated as described above gave a full-length DNA sequence encoding 
the native sequence PR0356 (NL4) (i.e., DNA47470, SEQ ID NO: 101) and the derived PR0356 protein 
sequence UNQ3 1 3 (SEQ ID NO: 1 02). 

The entire nucleotide sequence of DNA47470 is shown in Figure 37 (SEQ ID NO: 101). Clone 
DNA47470 (SEQ ID NO:101) contains a single open reading frame with an apparent translation initiation site 
at nucleotide positions 215-217, and a TAA stop codon at nucleotide positions 1038-1040.,as indicated by 
bolded underline. The predicted PR0356 polypeptide is 346 amino acids long (i.c. UNQ313 (SEQ ID 
NO:102), has a calculated molecular weight of 40,018 daltons and a pi of 8.19. A cDNA clone containing 
DNA47470 (SEQ ID NO:101) has been deposited with ATCC on October 28, 1997 and is assigned ATCC 
deposit no. 209422. 

T - Isolation of cDNA clones Encodinc Human PROS31 (UNQ332) 

Use of the ECD homology procedure identified above in a human fetal brain library resulted in the 
isolation of the full-length DNA sequence DNA48314 (Fig. 39, SEQ ID NO: 106) and the derived PR0531 
native sequence protein UNQ332 (Fig. 40, SEQ ID NO: 107) . 

The PCR primers (forward and reverse) and hybridization probe synthesized were: 
forward PCR primer. 5'-CTGAGAACGCGCCTGAAACTGTG-3' (SEQ ID NO: 108) 

reverse PCR primer S'-AGCGTTGTCATTGACATCGGCG-S' (SEQ ID NO: 109) 
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hybridization probe : (SEQ ID NO: 1 10) 

S'-TTAGTTGCTCCATTCAGGAGGATCTACCCXr^ 

Clone DNA48314 (SEQ ID NO: 106) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 171-173 and ending at the stop codon (TGA) at nucleotide 
positions 2565-2567 (Figure 39), as indicated by boldcd underline. The predicted PR0531 polypeptide 
precursor (i.e., UNQ332, SEQ ID NO: 107) is 789 amino acids long. The UNQ332 protein (SEQ ID NO: 107) 
shown in Figure 39 has an estimated molecular weight of about 87552 daltons and a pi of about 4.84. A clone 
containing DNA48314 (SEQ ID NO: 106) has been deposited with the ATCC on 26 March 1998, and has been 
assigned deposit number 209702. 

Analysis of the UNQ332 amino acid sequence of SEQ ID NO: 107 reveals a cadherin extracellular 
repeated domain signature at about amino acids 122-132, 231-241, 336-346, 439-449 and 549-559. An 
ATP/GTP -binding site motif A (P-loop) is found at about amino acids 285-292 of SEQ ID NO: 107. N- 
giycosylation sites are found at least at about amino acids 567-570, 786-790, 418-421 and 336-339, the signal 
peptide is at about amino acids 1-26, and the transmembrane domain is at about amino acids 685-712 of SEQ 
IDNO:I07. 

U. Isolation of cDNA clones Encoding Human PRQ533 (UNQ334) 

The EST sequence accession number AF007268, a munne fibroblast growth factor (FGF-I5) was 
used to search various public EST databases {e.g., GenBank. Dayhoff, etc.). The search was performed using 
the computer program BLAST or BLAST2 [Altschul et ai. Methods in Enzymology, 266:460-480 (1996)] as a 
comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. The search resulted 
in the identification of GenBank EST AA220994, which has been identified as stratagene NT2 neuronal 
precursor 937230. 

Based on this sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence. In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
screened by PCR amplification, as per Ausubel et aL Current Protocols in Molecular Biology, with the PCR 
primer pair. A positive library was then used to isolate clones encoding the PR0533 gene of interest by an in 
vivo cloning procedure using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal retina. The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 
reagents (e.g., Invitrogen, San Diego, CA; Clontech, etc.) The cDNA was primed with oligo dT containing a 
NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRKSB is a precursor of pRK5D that does not contain the Sfil site; Holmes et a!., Science. 253: 1278-1280 
( 1 99 1 )) in the unique Xhol and NotI sites. 

A cDNA clone was sequenced in its entirety. The full length nucleotide sequence DNA49435 (SEQ 
ID NO: 1 1 1) is shown in Figure 41. Clone DNA49435 (SEQ ID NO: 11 1) contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 464-466 and ending at the stop codon (TAA) 
at nucleotide positions 649-651, as indicated by bolded underline in Fig. 41. The predicted PR0533 

102 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCT/US00/05841 



polypeptide precursor (z.e, UNQ334, SEQ ID NO: 112) is 216 amino acids long, has a calculated molecular 
weight of 24,003 daltons and a pi of 6.99. Clone DNA49435-1219 has been deposited with ATCC (under the 
designation DNA49435-1219) on November 21, 1997 and is assigned ATCC deposit no. 209480. 

The oligonucleotide sequences used in the above procedure were the following: 
FGFlS.f: 5'-ATCCGCCCAGATGGCTACAATGTGTA-3' (SEQ ID NO:l 13) 

FGFlS.p: 5 r -GCCTCCCGGTCTCCCTGAGCAGTGCCAAACAGCGGCAGTGTA-3* (SEQ ID NO: 1 14) 

FGF15.H 5'-CCAGTCCGGTGACAAGCCCAAA-3' (SEO ID NO:l 15) 



v * Isolation of cDNA clones Encoding Human PRO 1083 (UNQ540) 

Use of the amylase yeast screen procedure described above on tissue isolated &om human fetal 
kidney tissue resulted in an EST sequence which served as the template for the creation of the 
oligonucleotides below and screening as described above in a human fetal kidney library resulted in the 
isolation of the full length DNA sequence DNA50921 (Fig. 43, SEQ ID NO: 1 16) and the derived PRO 1083 
native sequence protein UNQ540 (SEQ ID NO: 1 17). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use in the above 
procedure were the following: 

forward primer (43422.fl ): S'-GGCATTGG AGC AGTGCTGGGTG-3' (SEQ ID NO: 1 1 8) 

forward primer (43422.f2); 5'-AGAGCAACTCAGACAGCG-3* (SEQ ID NO: i 19) 

reverse primer (43422.rl): 5'-TGGAGGCCTAGATGCGGCTGGACG-3' (SEQ ID NO:120) 

reverse primer (43422.r2): 5'-CG AGG AG ACCATC AGCAC-3* (SEQ ID NO: 121) 

hybridization probe: (43422.pl): (SEQ j£> NO: 122) 

5 f -CCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTrGCAGTGC-3' 

Clone DNA5092I (SEQ ID NO: 11 6) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 154-156 and ending at the stop codon (TAG) at nucleotide 
positions 2233-2235 (Figure 43). as indicated by bolded underline. The predicted PRO 1083 polypeptide 
precursor (i.e., UNQ540, SEQ ID NO: 117, Figure 44) is 693 amino acids long. The UNQ540 (SEQ ID 
NO: 11 7) protein shown in Figure 44 has an estimated molecular weight of about 77738 and a pi of about 
8.87. A clone containing DNA5092I has been deposited with the ATCC on May 12, 1998 and has been 
assigned deposit number 209859. 

Analysis of the amino acid sequence UNQ540 (SEQ ID NO:l 17) reveals the putative signal peptide is 
at about amino acids 1-25, transmembrane domains are at about amino acids 382-398, 402-420, 445-468, 473- 
491, 519-537, 568-590 and 634-657, a microbodies C-terminal targeting signal at about amino acids 691-693, 
cAMP- and cGMP-dependent protein kinase phosphorylation sites at about amino acids 198-201 and 370-373, 
N-glycosylation sites at about amino acids 39-42, 148-151, 171-174, 234-237, 303-306. 324-227 and 341-344 
and a G-protein coupled receptor family domain at about amino acids 475-504. 

W. Isolation of cDNA clones Encoding Human PRQ865 (UNQ434) 
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Use of the amylase yeast screen procedure described above on tissue isolated from human fetal kidney 
tissue resulted in an EST sequence which served as the template for the creation of the oligonucleotides below 
and screening as described above in a human fetal kidney library resulted in the isolation of the full length 
DNA sequence DNA53974 (Fig. 45, SEQ ID NO: 123) and the derived PR0865 native sequence protein 
5 UNQ434(SEQIDNO:I24). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use in the above 
procedure were the following: 

forward primer (48615.fl): 5-AAGCTGCCGGAGCTGCAATG-3' (SEQ ID NO:125) 

forward primer (486t5.f2): S'-TTGCTTCTTAATCCTGAGCGCO' (SEQ ID NO: 126) 

10 forward primer (48615.B): 5 t -AAAGGAGGACTTTCGACTGC-3 , (SEQ ID NO: 127) 

reverse primer (48615,rl): 5'-AGAGATTCATCCACTGCTCCAAGTCG-3' (SEQ ID NO: 128) 

reverse primer: (486 15.r2): 5 -TGTCCAGAAACAGGCACATATCAGC-3' (SEQ ID NO:129) 

hybridization probe: (43422.pl): (SEQ ID NO: 130) 

S'-AGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATO' 

l 5 Clone DNA53974 (SEQ ID NO:123) contains a single open reading frame with an apparent 

translation initiation site at nucleotide positions 173-175 and ending at the stop codon (TAA) at nucleotide 
positions 1577-1579 (Figure 45). as indicated by bolded underline. The predicted PR0865 polypeptide 
precursor (i.e.. UNQ865, SEQ ID NO: 124) is 468 amino acids long. The UNQ434 (SEQ ID NO: 124) protein 
shown in Figure 46 has an estimated molecular weight of about 54,393 and a pi of about 5.63. A clone 

20 containing DNA53974 (SEQ ID NO:I23) has been deposited with the ATCC on April 14, 1998 and has been 
assigned deposit number 209774. 

Analysis of the amino acid sequence UNQ434 (SEQ ID NO: 124) reveals the putative signal peptide at 
about amino acid residues 1-23, potential N-glycosyiacion sites at about amino acids residue 280 and at about 
384, a potential amidation site from about amino acid residue 94 to about residue 97. glycosaminoglycan 

25 attachment sites from about ammo acid residue 20 to about 23 and from about residue 223 to about residue 
226. an aminotransferase class-V pyridoxyl- phosphate ammo acid sequence block from about amino acid 
residue 216 to about residue 222 and an amino acid sequence block similar to that found in the interleukin-7 
. protein from about amino acid residue 338 to about residue 343. 

30 X. Isolation of cDNA clones Encodinc Human PRO770 (UNQ408) 

A public expressed sequence tag (EST) DNA database (Merck/Washington University) was searched 
with the full-length murine m-FIZZl DNA (DNA53517), and an EST, designated AA524300 was identified, 
which showed homology with the m-FIZZl DNA. 

The full-length clone corresponding to the EST AA524300 was purchased from Incyte (Incyte 
35 Pharmaceuticals, Palo Alto, CA) and sequenced in entirety. 

The entire nucleotide sequence of the resulting PRO770-encoding full-length clone is shown in 
Figure 47. This full-length clone, designated DNA54228 (SEQ ID NO: 133), contains a single open reading 
frame with an apparent translation initiation site at nucleotide positions 100-102 (Fig.47; SEQ ID NO: 133) and 
ending at the stop codon (TGA) at residues 433-435, as indicated by bolded underline. The predicted PRO770 
40 polypeptide precursor (including a putative signal sequence of 20 amino acids) [Le„ UNQ408, SEQ ID 
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NO: 134) is 1 i 1 amino acids long, has a calculated molecular weight of 1 1,730 daltons and a pi of 7.82. Based 
upon its homology to m-FIZZl (50%, using the ALIGN software), the protein is believed to be the human 
homolog of m-FIZZl, and has been designated h-FIZZl. A cDNA clone containing DNA54228 (SEQ ID 
NO: 133) has been deposited with ATCC and is assigned ATCC deposit no. 209801. 
Identification and cloning of m-FIZZl (DNA53517) 

Mouse astlxma model Female Balb/C mice, 6 to 8 weeks of age, were separated into two 
experimental groups: controls and asthmatics. The asthmatic group was immunized intraperitoneally with 10 
ug ovalbumin * I mg alum, while the control group was not. Two weeks later, mice were exposed daily to an 
aerosol of 10 mg/ml ovalbumm in PBS aerosolized with a UltraNeb nebulizer (DeVilbiss) at the rate of 2 
ml/min for 30 min each day, for 7 consecutive days. One day after the last aerosol challenge, whole blood, 
serum and bronchoalveoiar lavage (BAL) samples were collected and the lungs were harvested and preserved 
for histological examination, immuno-histochemistry and in situ hybridization. 

Gel electrophoresis of BAL *am/?teExamination of the BAL samples by gel electrophoresis on a 16% 
Tricine gel shows that a low molecular weight protein is expressed in the BAL samples from asthmatic mice . 
but not in the BAL, samples from control mice. This low molecular weight protein was termed m-FIZZl and 
was seen to co- mi grate with a 8300 Dalton marker protein. 

Partial protein sequence The protein of interest was transferred upon a PVDF membrane and 
sequenced by Edman degradation. This sequence served as a template for the preparation of various cloning 
oligos as described below. 

Partial cDNA sequence We designed two degenerate oligonucleotide PCR primers corresponding 
to the putative DNA sequence for the first 7 and the last 7 amino acids of the partial protein sequence.. 
Oligo #1: 

5'- ACA AAC GCG TG A YG A RAC NAT HGA RAT-3* (SEQ ID NO: 1 35) 

Oligo #2: 

5'-TGG TGC ATG CGG RTA RTT NGC NGG RTT-3' (SEQ ID NO: 1 36) 

cDNA prepared from the lungs of normal mice was used as a template for the PCR reaction which 

yielded an 88 bp product. This 88 bp product contained 54 known base pairs, encoding the PCR primers, and 

34 novel base pairs, and encoded another partial mFIZZ- 1 sequence. 

Full length cDNA clone This second partial sequence was used to design primers which were 

ultimately successful in obtaining the full length FIZZ clone (DNA535I7) by RT-PCR of mouse lung poiy(A)* 

RNA. 

Oligo #3: 

5'-ACA AAC GCG TGC TGG AGA ATA AGG TCA AGG-3' (SEQ ID NO: 137) 

This oligo was used as an RT-PCR primer in combination with 5' and 3' amplimers from Clontech. 
Oligo #4: 

5'-ACT AAC GCG TAG GCT AAG GAA CTT CTT GCC-3' (SEQ ID NO: 138) 

This oligo was used as an RT-PCR primer in combination with oligo d(T). 

Y. Isolation of cDNA clones Encoding Human PRQ769 (UNQ407) 
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A public expressed sequence tag (EST) DNA databases (Merck/ Washington University) was searched 
with the full-length murine m-FIZZl DNA (DNA 53517) described above and the EST W42069 was 
identified. 

TTie full-length clones corresponding to the EST fragment W42069 was obtained from Incyte 
5 Pharmaceuticals (Palo Alto, California), and sequenced in the entirety, which ultimately resulted in the 
identification of the fiili length nucleotide sequence DNA54231 (SEQ ID NO: 139). 

The nucleotide sequence corresponding to the full length, native sequence PR0769 clone is shown in 
Figure 49. This clone, designated DNA 54231 (SEQ ID NO: 139) contains a single open reading frame with an 
apparent translation initiation site at nucleotide positions 75-77 and ending at the stop codon (TGA) at residues 
10 4 1 7-4 1 9. as indicated by bolded underline (Fig. 49). The predicted PR0769 polypeptide precursor (including a 
signal sequence of 10 amino acids)(/.*.. UNQ407, SEQ ID NO:140) is 1 14 amino acids long, has a calculated 
molecular weight of 12,492 daltons and a pi of 8.19. Based on its homology to m-FIZZl (34%, using the 
ALIGN software) the protein was designated m-FIZZ3. A clone containing DNA5423I (designated 
DNA5423 1-1366) has been deposited with ATCC on April 23, 1998 and has been assigned ATCC deposit no. 
15 209802. 

Z. Isolation of cDNA clones Encoding Human PRQ788 (UN 04 30) 

Use of the ECD homology procedure identified above resulted in the identification of the partial 
length EST sequence 2777282. Further analysis of the corresponding full-length sequence resulted in the 
20 identification of DNA56405 (SEQ ID NO: 141) and the derived native sequence PR0788 protein UNQ430 
(SEQ ID NO: 142). 

Clone DNA56405 (SEQ ID NO: 141) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 84-86 and ending at the stop codon (TAG) at nucleotide 
positions 459-461 (Figure 51). as indicated by bolded underline. The predicted native sequence PR0788 

25 polypeptide precursor {i.e.. UNQ430, SEQ ID NO: 142) is 125 amino acids long (Figure 52). has a calculated 
molecular weight of 13,1 15 daltons and a pi of 5.90. The UNQ430 (SEQ ID NO: 142) protein shown in Figure 
52 has an estimated molecular weight of about 131 15 and a pi of about 5.90. A clone containing DNA56405 
(SEQ ID NO: 142) has been deposited with the ATCC on May 6, 1998 and has been assigned deposit 
number209849. In the event of a discrepancy in the nucleotide sequence of the deposit and the sequences 

30 disclosed herein, it is understood that the deposited clone contains the correct sequence. It is further 
understood that the methodology of sequencing for the sequences provided herein are based on known 
sequencing techniques. . 

Analysis of UNQ430 (SEQ ID NO:52) shown in Figure 52 reveals a signal peptide at about amino 
acids I - 1 7 and an N-glycosy lation site is at about amino acids 46. 

35 

AA. Isolation of cDNA clones Encoding Human PRQ1 1 14 (UNQ557) 

Use of the amylase yeast screen procedure described above on tissue isolated from human fetal 
kidney tissue resulted in an EST sequence which served as the template for the creation of the 
oligonucleotides below and screening as described above in a human breast carcinoma library resulted in the 
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isolation of the full length DNA sequence DNA57033 (Fig. 53, SEQ ID NO: 143) and the derived PROllH 
native sequence protein UNQ557 (Fig. 54, SEQ ID NO: 144). 

The PCR primers used in the isolation screen described in the previous paragraph were: 
forward primer (48466.fl ): 5'-AGGCTTCGCTGCGACTAGACCTC-3' (SEQ ID NO: 145) 

reverse primer: (48466.r 1 ): 5'-CCAGGTCGGGTAAGGATGGTTGAG-3' (SEQ ID NO: 146) 

hybridization probe: 48466. p 1 ) : 

S^TTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCO 1 (SEQ ID NO: 147) 

Clone DNA57033 (SEQ ID NO: 143) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 250-252 and ending at the stop codon (TAG) found at 
nucleotide positions 1 183-1 185 (Figure 53. SEQ ID NO: 143). as indicated by bolded underline. The predicted 
PROl 1 14 polypeptide precursor (i.e.. UNQ557, SEQ ID NO: 144) is 311 amino acids long, has a calculated 
molecular weight of approximately 35,076 daltons and an estimated pi of approximately 5.04. Analysis of the 
full-length PROI 1 14 sequence shown in Figure 54 (SEQ ID NO:144) evidences the presence of the following: 
a signal peptide from about amino acid I to about amino acid 29. a transmembrane domain from about amino 
acid 230 to about amino acid 255. potcnt.al N-glycosylation sites from about amino acid 40 to about amino 
acid 43 and from about amino acid 134 to about amino acid 137. an amino acid sequence block having 
homology to tissue factor proteins from about amino acid 92 to about amino acid 119 and an amino acid 
sequence block having homology to integrin alpha cham proteins from about amino acid 232 to about amino 
acid 262. A cDNA clone containing DNA57033 (SEQ ID NO: 143) has been deposited with ATCC on May 
27, 1998 and is assigned ATCC deposit no. 209905. 

AB. Isolation of cDNA clones Encoding Human PRO 1007 (UNQ491) 

Use of the ECD homology procedure described above resulted in the identification of an EST 
sequence designated Merck EST T705I3, which was derived from human liver tissue (clone 83012 from 
library 341) was further examined. The corresponding full-length clone was further examined and sequenced, 
resulting in the isolation of the full-length DNA sequence DNA57690 (Fig. 55. SEQ ID NO: 145) and the 
derived PRO 1007 native sequence protein UNQ491 (Fig. 56. SEQ ID NO: 146). 

Clone DNA57690 (SEQ ID NO: 145) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 16-18 and ending at the stop codon (TGA) at nucleotide 
positions 1054-1056 (Figure 55), as indicated by bolded underline. The predicted PRO1007 polypeptide 
precursor (/.*.. UNQ49I, SEQ ID NO: 146) is 346 amino acids long (Figure 56), has a calculated molecular 
weight of 35,971 daltons and a pi of 8.17. The UNQ49I (SEQ ID NO: 146) protein shown in Figure 56 has an 
estimated molecular weight of about 35971 daltons and a pi of about 8.17. A cDNA clone containing 
DNA57690 (SEQ ID NO: 145) has been deposited with the ATCC on 9 June 1998, and has been assigned 
deposit number 209950. 

Analysis of the amino acid sequence of UNQ491 (SEQ ID NO: 146) reveals the putative signal peptide 
at about amino acid residues 1-30, a transmembrane domain at about amino acid residues 325-346, N- 
glycosylation sites at about amino acid residues 118, 129, 163, 176, 183 and 227 and a Ly-6/u-Par domain 
proteins at about amino acid residues 17-36 and 209-222. The corresponding nucleotides of the amino acids 
presented herein can be routinely determined given the sequences provided herein. 
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AC. Isolation of cDNA clones Encoding Human PROl 184 (UNQ598) 

Use of the signal algorithm procedure described above resulted in the identification of Incyte EST 
1428374 which was derived from an ileum tissue library (39, SINTBSTOl). Further examination of the full- 
length clone corresponding to this sequence resulted in the isolation of the full-length DNA59220 (Fig. 57, 
SEQ ID NO: 147) and the derived PROl 184 native sequence protein UNQ598 (Fig. 58, SEQ ID NO: 148). 

UNQ598 (SEQ ID NO: 148), as shown in Figure 58 exhibits an apparent translation initiation site at 
nucleotide positions 106-108 and ending at the stop codon (TGA) found at nucleotide positions 532-534, as 
indicated by bolded underline. The predicted PROl 184 polypeptide precursor (i.e., UNQ598, SEQ ID 
NO:148) is 142 amino acids long, has a calculated molecular weight of approximately 15690 daltons and an 
estimated pi of approximately 9.64. Analysis of UNQ598 (SEQ ID NO: 148) evidences the presence of a signal 
peptide at about amino acids 1-38. A cDNA clone containing DNA59220 (SEQ ID NO: 147) has been 
deposited with the ATCC on 9 June 1998, and has been assigned deposit number 209962. It is understood that 
the deposited clone has the actual sequences and that representations are presented herein. 

AD. isolation of cDNA clones Encodimi Human PRO 1 03 1 (UNQ516) 

Use of the ECD homology procedure described above resulted in the identification of the EST 
sequence Merck W74558 (clone 344649). The corresponding full-length clone was examined and sequenced 
resulting in the isolation of DNA sequencing gave the full-length DNA sequence DNA59294 (Fig. 59, SEQ ID 
NO:149) and the derived PROI03! native sequence protein UNQ5I6 (Fig. 60. SEQ ID NO: 150). 

Clone DNA59294 (SEQ ID NO: 149) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 42-44 and ending at the stop codon (TGA) at nucleotide 
positions 582-584 (Figure 59), as indicated by bolded underline. The predicted PRO1031 polypeptide 
precursor (i.e.. UNQ5I6. SEQ ID NO: 150) is 180 amino acids long (Figure 60). The UNQ516 protein shown 
in Figure 60 . has an estimated molecular weight of about 20437 and a pi of about 9.58. Clone DNA59294 
(SEO ID NO: 149) has been deposited with the ATCC on May 14, 1998 and has been assigned deposit number 
209866. Regarding the sequence, it is understood that the deposited clone contains the correct sequence, and 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of UNQ516 (SEQ ID NO: 150) reveals the putative signal peptide 
at about amino acid residues 1-20, an N-giycosyiatton site is at about amino acid residue 75. A region having 
sequence identity with IL-I7 is at about amino acid residues 96-180. The corresponding nucleotides can be 
routinely determined given the sequences provided herein. 

AE. Isolation of cDNA clones Encodine Human PRQ1346 (UNQ70n 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
isolation of the full-length DNA sequence DNA59776 (Fig. 61, SEQ ID NO:l51) and the derived PR01346 
native sequence protein UNQ701 (Fig. 62, SEQ ID NO: 152). 

The PCR primers (forward and reverse) and hybridization probe used in the isolation of DNA59776 
(SEQ ID NO:151) were the following: 

forward PCR primer (45668.fl): 5 f -CACACGTCCAACCTCAATGGGCAG-3' (SEQ ID NO: 153) 

108 



SUBSTITUTE SHEET (RULE 25) 



WO 00/53758 



PCT/US00/05841 



reverse PCR primer (45668.rl ): S'-GACCAGCAGGGCCAAGGACAAGGO' (SEQ ID NO: 1 54) 

hybridization probe (45668.p 1 ): (SEQ ID NO: 155) 

5 , -GTTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3 , 

Clone DNA59776 (SEQ ID NO: 151) contains a single open reading frame with an apparent 
5 translation initiation site at nucleotide positions 1-3 (ATG), and an apparent stop codon (TAG) at nucleotide 
positions 1384-1386 (TAG). The predicted PR01346 polypeptide precursor (/.<?., UNQ70I, SEQ ID NO:152) 
is 461 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid 
positions from about 3 1 to about 50, fibrinogen beta and gamma chains C-terminal domain signature at about 
amino acid positions 409-421 and a leucine zipper patterns at about amino acid positions 140-161, 147-168. 
10 154-175 and 161-182. 

A cDNA clone containing DNA59776, designated as DNA59776-1600, has been deposited with 
ATCC on August 18. 1998 and is assigned ATCC deposit no. 203128. The UNQ701 (SEQ ID NO:I52) 
protein shown in Figure 62 has an estimated molecular weight of about 50744 daltons and a pi of about 6.38. 

15 AF. Isolation of cDNA clones Encoding Human PROl 155 (UNQ585) 

Use of the signal algorithm procedure described above resulted in the identification of Incyte EST 
2858870 which was derived from an ileum tissue library (39, SrNINOT03). Further examination of the full- 
length clone corresponding to this sequence resulted in the isolation of the full-length DNA sequence 
DNA59849 (Fig. 63, SEQ ID NO: 156) and the derived PROl 155 native sequence protein UNQ585 (Fig. 64, 

20 SEQ ID NO: 157). 

The UNQ585 (SEQ ID NO: 157) polypeptide shown in Figure 64 contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 158-160 and ending at the stop codon (TAA) 
found at nucleotide positions 563-565. as indicated by bolded underline. The predicted PROl 155 polypeptide 
precursor {i.e.. UNQ585. SEQ ID NO: 157) is 135 amino acids long, and signal peptide appears at about amino 
25 acids residues 1 to about 18, a leucine zipper pattern appears at about amino acid residues 43 to 64 and a 
tachykinin family signature appears at about amino acid residues 86 to about 91. UNQ585 (SEQ ID NO:I57) 
has a calculated molecular weight of approximately 14833 daltons and an estimated pi of approximately 9.78. 
A cDNA clone containing DNA59849 (SEQ ID NO: 156), designated as DNA59849-1504. has been deposited 
with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209986. 

30 * 

AG. Isolation of cDNA clones Encoding Human PRO 1250 (UNQ633) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence from the Incyte database, designated Incyte EST cluster sequence no. 56523. This sequence was then 
compared to a variety of various EST databases as described under the signal algorithm procedure above, and 
35 further resulted in the identification of Incyte EST 3371784. Further examination and sequencing of the full- 
length clone corresponding to this EST sequence resulted in the isolation of the full-length DNA sequence 
DNA60775 (Fig. 65, SEQ ED NO: 158) and the derived PRO 1250 native sequence protein UNQ633 (Fig. 66, 
SEQ ID NO: 159). 

Clone DNA60775 (SEQ ID NO: 158) contains a single open reading frame with an apparent 
40 translation initiation site at nucleotide positions 74-76 and ending at the stop codon (TAG) at nucleotide 
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positions 2291-2293 (Figure 65). The predicted PRO1250 polypeptide precursor (/.<?., UNQ633, SEQ ID 
NO:159) is 739 amino acids long (Figure 66). The UNQ633 (SEQ ID NO:159) protein shown in Figure 66 has 
an estimated molecular weight of about 82,263 daltons and a pi of about 7.55. Analysis of UNQ633 (SEQ ID 
NO: 159) evidences the presence of the following: a type II transmembrane domain from about amino acid 
residues 61 to about 80, a putative AMP-binding domain signature sequence from about amino acid residue 
314 to about 325. and potential N-glycosylation sites from about amino acid residues 102 to about 105, from 
about amino acid residues 588 to about 591 and from about amino acid residues 619 to about 622. A cDNA 
clone containing DNA60775 (SEQ ID NO: 158) has been deposited with the ATCC on September 1, 1998 and 
is assigned ATCC deposit no. 203 173. 

AH. Isolation of cDNA clones Encoding Human PRQ1312 (UNQ678) 

An EST (DNA55773) was identified in a human fetai kidney cDNA library using a yeast screen, that 
preferentially represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, 
oligonucleotides were synthesized for use as probes to isolate the full-length DNA sequence DNA61873 (Fig. 
67. SEQ ID NO: 160) and the derived PRO 13 12 native sequence UNQ678 (SEQ ID NO: 161). 

The full length DNA61873 clone shown in Figures 67 (SEQ ID NO: 1 60) contains a single open 
reading frame with an apparent translation initiation site at about nucleotide positions 7-9 and ending at the 
stop codon (TGA) found at about nucleotide positions 643-645, as indicated by bolded underline. The 
predicted PRO 13 12 polypeptide precursor (i.e., UNQ678, SEQ ID NO: 161 ) is 212 amino acids long. UNQ678 
(SEQ ID NO: 1 61) has a calculated molecular weight of approximately 24,024 daltons and an estimated pi of 
approximately 6.26. Other features include a signal peptide at about amino acids 1-14: a transmembrane 
domain at about amino acids 141-160, and potential N-glycosylation sites at about amino acids 76-79 and 93- 
96. A clone containing DNA61873 (SEQ ID NO:160) has been deposited with the ATCC on August 18. 1998, 
under the designation DNA61 873- 13 12, and has been assigned deposit number 203132. 

AI. Isolation of cDNA clones Encoding Human PROl 192 (UNO606) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the full-length DNA sequence DNA62814 (Fig. 69, SEQ ID NO:162) and the derived PROl 192 
native sequence protein UNQ606 (Fig. 70, SEQ ID NO: 163). 

The PCR primers (forward and reverse) and hybridization probe used in the isolation of DNA62814 
(SEQ ID NO: 162) were the following: 

forward PCR primer (35924.fl): 5'-CCGAGGCCATCTAGAGGCCAGAGC-3' (SEQ ID NO:164), 
reverse PCR primer (35924.rl): 5'-ACAGGCAGAGCCAATGGCCAGAGC-3' (SEQ ID NO: 165). 
hybridization probe (35924.p 1 ): ( SE q ^ NQ . { 66) 

5 t -GAGAGGACTGCGGGAGTTTGGGACCTTTGTGCAGACGTGCTCATG-3 , 

Clone DNA62814 (Fig. 69, SEQ ID NO: 162) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 121-123, and an apparent stop codon (TAA) at nucleotide 
positions 766-768, as indicated by bolded underline. The predicted PROl 192 polypeptide precursor (Le., 
UNQ606, SEQ ID NO: 163) is 215 amino acids long. The UNQ606 (SEQ ID NO: 1 63) polypeptide precursor 
shown in Figure 70 has a signal peptide at about amino acids 1-21; a transmembrane domain at about amino 
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acids 153-176; potential N-glycosyiation sites at about amino acids 39-42 and 118-121; and homology with 
myelin P0 proteins at about amino acids 27-68 and 99-128. The UNQ606 (SEQ ID NO: 163) shown in Figure 
70 has an estimated molecular weight of about 24,484 Daltons and a pi of about 6.98. 

A cDNA clone containing DNA62814 (SEQ ID NO:162), designated as DNA62814-1521, was 
5 deposited with the ATCC on August 4, 1998, and is assigned ATCC deposit no. 203093. 

AJ. Isolation of cDNA clones Encoding Human PRO 1246 (UNQ630^ 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence from the Incyte database, designated Incyte EST cluster sequence no. 56853. This sequence was then 
10 compared to a variety of various EST databases as described under the signal algorithm procedure above, and 
further resulted in the identification of Incyte EST 2481345. Further examination and sequencing of the full- 
length clone corresponding to this EST sequence resulted in the isolation of the full-length DNA sequence 
DNA64885 (Fig. 71, SEQ ID NO: 167) and the derived PR01246 native sequence protein UNQ630 (Fig. 72, 
SEQ ID NO: 168). 

15 Clone DNA64885 (SEQ ID NO: 167) contains a single open reading frame with an apparent 

translation initiation site at nucleotide positions 119-121 and ending at the stop codon (TGA) at nucleotide 
positions 1727-1729 (Figure 71). as indicated by bolded underline. The predicted PRO 1 246 polypeptide 
precursor {U>.. UNQ630. SEQ ID NO: 168) is 536 amino acids long (Figure 72). has an estimated molecular 
weight of about 61,450 daltons and a pi of about 9.17. Analysis of UNQ630 (Fig. 72, SEQ ID NO:168) reveals 

20 the following: a signal peptide from about amino acid 1 to about amino acid 15. potential N-giycosylation sites 
from about amino acid 108 to about amino acid III, from about amino acid 166 to about amino acid 169, from 
about amino acfd 193 to about amino acid 196, from about amino acid 262 to about amino acid 265, from 
about amino acid 375 to about amino acid 378. from about amino acid 413 to about amino acid 416 and from 
about amino acid 498 to about amino acid 501 and amino acid sequence blocks having homology to sulfatase 

25 proteins from about amino acid 286 to about ammo acid 315. from about amino acid 359 to about ammo acid 
369 and from about amino acid 78 to about amino acid 97. A cDNA containing DNA64885 (SEQ ID 
NO: 167). designated DNA64885-1529. has been deposited with ATCC on November 3, 1998 and is assigned 
ATCC deposit no. 203457. 

30 AK. Isolation of cDNA clones Encodine Human PRQ1283 (UNQ653) 

Use of the ECD homology procedure described above in a human breast tumor tissue library resulted 
in the isolation of the full-length DNA sequence DNA65404 (Fig. 73, SEQ ID NO: 169) and the derived 
PRO 1 283 native sequence protein UNQ653 (Fig. 74, SEQ ID NO: 1 70). 

The PCR primers (forward and reverse) and hybridization probes used in the isolation of DNA65404 

35 (SEQ ID NO: 169) were the following: 

forward PCR primer (28753.fl): 5 , -GGAGATGAAGACCCTGTTCCTG-3' (SEQ ID NO: 171) 

forward PCR primer (28753.fi 1): S'-GGAGATGAAGACCCTGTTCCTGGGTG-S' (SEQ ID NO: 172) 
reverse PCR primer (28753^1): S'-GTCCTCCGGAAAGTCCTTATCO' (SEQ ED NO: 173) 

reverse PCR primer (28753.rl 1): 5-GCCTAGTGTTCGGGAACGCAGCTTC-3' (SEQ ED NO: 174) 

40 hybridization probe (28753.pl): ( SE q rjj NO:I75) 
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5'-CAGGGACCTGGTACGTGAAGGCCATGGTGGTCGATAAGGACTTTCCGGAG-3' 

hybridization probe (28753.pl 1): (SEQ IDN0 :176) 

5'-CTGTCCTTCACCCTGGAGGAGGAGGATATCACAGGGACCTGGTAC-3' 

Clone DNA65404 (SEQ ID NO: 169) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 45-47 and ending at the stop codon (TAG) at nucleotide 
positions 555-557 (Figure 73), as indicated by bolded underline. The predicted PR01283 polypeptide 
precursor (i.e., UNQ653, SEQ ID NO:170) is 170 amino acids long (Figure 74). The UNQ653 (SEQ ID 
NO:170) protein shown in Figure 74 has an estimated molecular weight of about 19.457 daltons and a pi of 
about 9.10. Analysis of the UNQ653 (SEQ ID NO:170) ev.dences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17. A cDNA clone containing DNA65404 (SEQ ID 
NO:I69), designated DNA65404-1551. has been deposited with ATCC on September 9. 1998 and is assigned 
ATCC deposit no. 203244. 

AL. Isolation of cDNA clones Encoding Human PRO 11 95 (UNQ608) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 32204 from the Incyte database. Tins sequence was then compared to a variety of various EST 
databases as described under the signal algorithm procedure above, and further resulted in the identification of 
Incyte EST352980. Further examination and sequencing of the full-length clone corresponding to this EST 
sequence resulted in the isolation of the full-length DNA sequence DNA65412 (Fig. 75, SEQ ID NO: 177) and 
the derived PROl 195 native sequence protein UNQ608 (Fig. 76, SEQ ID NO: 178). 

The foil length clone DNA65412 (SEQ ID NO:177) contains a single open reading frame with an 
apparent translation initiation site at nucleotide positions 58-60 and ending at the stop codon (TAG) found at 
nucleotide positions 511-513 (Figure 75). as indicate by bolded underline. The predicted PROII95 
polypeptide precursor {La.. UNQ608, Figure 76. SEQ ID NO: 178) is 151 amino acids long, has a calculated 
molecular weight of 17.227 daltons and a pi of 5.33. Analysis of UNQ608 (SEQ ID NO: 1 78) reveals a signal 
sequence at about amino acids 1-22. a calculated molecular weight of approximately 17277 daltons and an 
estimated pi of approximately 5.33. A cDNA clone containing DNA654I2 (SEQ ID NO:177), designated as 
DNA65412-I523, was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposu no. 203094. 

AM. Isolation of cDNA clones Encoding Human PRO 1343 (UNQ698) 

Use of the amylase yeast screen procedure described above on tissue isolated from human smooth 
muscle cell tissue resulted in an EST sequence which served as the template for the creation of the 
oligonucleotides below and screening as described above in a human smooth muscle cell tissue library 
resulted in the isolation of the &II length DNA sequence DNA66675 (Fig. 77, SEQ ID NO: 179) and the 
derived PR01343 native sequence protein UNQ698 (Fig. 78, SEQ ID NO: 180). 

The oligonucleotide probes employed were as follows: 
forward PCR primer (4892l.fi) 5'-CAATATGCATCTTGCACGTCTGG-3' (SEQ ID NO:181) 
reverse PCR primer (48921.rl) 5 , -AAGCTTCTCTGCrTCCTTTCCTGC-3' (SEQ ID NO: 182) 
hybridization probe (4892 I.dH 

5'-TGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGGGCTG-3' (SEQ ID NO: 183) 
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The full length clone DNA66675 (SEQ ID NO:179) contains a single open reading frame with an 
apparent translation initiation site at nucleotide positions 71-73. and a stop signal (TAA) at nucleotide positions 
812-814 (Figure 77), as indicated by bolded underline. The predicted PR01343 polypeptide precursor (Le.. 
UNQ698, SEQ ID NO:180, Fig. 78) is 247 amino acids long, has a calculated molecular weight of 
approximately 25,335 daltons and an estimated pi of approximately 7.0. Analysis of the UNQ698 sequence 
shown in Figure 78 (SEQ ID NO:I80) evidences the presence of the following: a signal peptide from about, 
amino acid I to about amino acid 25 and a homologous region to circumsporozoite repeats from about amino 
acid 35 to about amino acid 225. A cDNA clone containing DNA66675 (SEQ ID NO:I79), designated 
DNA66675-1587, has been deposited with ATCC on September 22, 1998 and is assigned ATCC deposit no 
203282. 

Alternatively, a comparison of the yeast EST sequence isolated from the amylase screen above was 
screened against various EST databases, both public and private (e.g.. see ECD homology procedure, above) 
resulting in the identification of Incyte EST clone no. 4701148. Further analysis and sequencing of the 
corresponding full-length clone resulted in isolation of the DNA66675 sequence (SEQ ID NO: 179) shown in 
Figure 77. 

AN - Isolation of cDNA clones Encoding Human PROI418 (UNQ732) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 10698 (Incyte cluster 121480). This sequence was then compared to a variety of various EST 
databases (including those denved from a placenta tissue library) as described under the signal algorithm 
procedure above, and further resulted in the identification of Incyte EST1306026. Further examination and 
sequencing of the full-length clone corresponding to this EST sequence resulted in the isolation of the full- 
length DNA sequence DNA68864 (Fig. 79, SEQ ID NO:184) and the denved PR01418 native sequence 
protein UNQ732 (Fig. 80, SEQ ID NO: 1 85). 

The full length clone shown in Figure 79 (DNA68864, SEQ ID NO: 184) contains a single open 
reading frame with an apparent translation initiation site at nucleotide positions 138-140 and ending at the stop 
codon (TAA) found at nucleotide positions 1 188-1190. as indicated by bolded underline. The predicted 
PROI418 polypeptide precursor (i.e.. UNQ732, SEQ ID NO:185) is 350 amino acids long with a signal' 
peptide at about amino acids 1-19, a calculated molecular weight of approximately 39003 daltons and an 
estimated pi of approxhnately 5.59. A cDNA clone containing DNA68864 (SEQ ID NO: 184), designated as 
DNA68864-I629 was deposited with the ATCC on September 22, 1998 and is assigned ATCC deposit no. 
203276. 



AO- Isolation of c DNA clones Encoding Human PRO 1387 (UNQ722) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 10298. This sequence was then compared to a variety of various EST databases as described under 
the signal algorithm procedure above, and further resulted in the identification of Incyte EST3507924. Further 
examination and sequencing of the full-length clone corresponding to this EST sequence resulted in the 
isolation of the full-length DNA sequence DNA68872 (Fig. 81, SEQ ID NO:186) and the derived PR01387 
native sequence protein UNQ722 (Fig. 82, SEQ ID NO: 1 87). 
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Clone DNA68872 (SEQ ID NO: 1 86) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 76-78 and ending at the stop codon (TGA) at nucleotide 
positions 1258-1260 (Figure 81). as indicated by bolded underline. The predicted PR01387 polypeptide 
precursor (i.e.. UNQ722, SEQ ID NO: 187) is 394 amino acids long. The UNQ722 (SEQ ID NO:187) protein 
shown in Figure 82 has an estimated molecular weight of about 44,339 daltons and a pi of about 7.10. 
UNQ722 (SEQ ID NO: 1 87) further contains a signal peptide from about amino acid residues I to about residue 
19, a transmembrane domain from about residue 275 to about residue 296. potential N-glycosylation sites at 
about residues 76, 231, 302. 307 and 376 and amino acid sequence blocks having homology to myelin pO 
protein from about amino acid residue 210 to about residue 239 and from about amino acid residue 92 to about 
residue 121. A cDNA clone containing DNA68872. designated as DNA68872-1620. has been deposited with 
the ATCC on August 25, 1 998 and is assigned ATCC deposit no. 203 1 60. 

AP . Isolation of cDNA clones Encoding Human PRQ141 0 (UNQ728) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 98502. This sequence was then compared to a variety of various EST databases as described under 
the signal algorithm procedure above, and further resulted in the identification of Incyte EST1 257046. Further 
examination and sequencing of the full-length clone corresponding to this EST sequence resulted in the 
isolation of the full-length DNA sequence DNA68874 (Fig. 83. SEQ ID NO: 188) and the derived PR01387 
native sequence protein UNQ728 (Fig. 84. SEQ ID NO: 189). 

Clone DNA68874 (SEQ ID NO: 188) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 152-154 and ending at the stop codon (TGA) at nucleotide 
positions 866-868 (Figure 83). as indicated by bolded underline. The predicted PRO1410 polypeptide 
precursor (i.e.. UNQ728, SEQ ID NO: 189) is 238 amino acids long (Figure 84). The UNQ728 protein (SEQ 
ID NO: 189) shown in Figure 84 has an estimated molecular weight of about 25.262 daltons and a pi of about 
6.44. a signal peptide from about amino acid residue 1 to about residue 20. a transmembrane domain from 
about ammo acid residue 194 to about residue 220 and a potential N-glycosylation site at about amino acid 
residue 132. A clone containing DNA68874 (SEQ ID NO: 188) has been deposited with ATCC on September 
22. 1998 and is assigned ATCC deposit no. 203277. 

AQ. Isolation of cDNA clones Encoding Human PRO 191 7 (UNQ900) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 85496. This sequence was then compared to a variety of various EST databases as described under 
the signal algorithm procedure above, and further resulted in the identification of Incyte EST3255033. This 
EST was derived from an ovarian tumor library. Further examination and sequencing of the full-length clone 
corresponding to this EST sequence resulted in the isolation of the full-length DNA sequence DNA76400 (Fig. 
85, SEQ ID NO:190) and the derived PR01917 native sequence protein UNQ900 (Fig. 86. SEQ ID NO:I9I). 

The full length clone DNA76400 (SEQ ID NO: 190) shown in Figure 85 contains a single open 
reading frame with an apparent translation initiation site at nucleotide positions 6 to 9 and ending at the stop 
codon (TGA) found at nucleotide positions 1467 to 1469 as indicated by bolded underline. The predicted 
PR01917 polypeptide precursor («.<>.. UNQ900, SEQ ID NO:191) is 487 amino acids long. UNQ900 (SEQ ID 
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NO: 191) has a calculated molecular weight of approximately 55,051 daltons and ah estimated pi of 
approximately 8.14, Additional features include: a signal peptide at about amino acid residues 1-30; potential 
N-glycosylation sites at about amino acid residues 242 and 481, protein kinase C phosphorylation sites at about 
amino acid residues 95-97, 182-184, and 427-429; N-myristoylation sites at about amino acid residues 107- 
5 112, 113-118, 117-122, 118-123, and 128-133; and an endoplasmic reticulum targeting sequence at about 
amino acid residues 484-487. 

AR. Isolation of cDNA clones Encoding Human PRQ1868 (UNQ859) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 

10 identification of EST clone no. 2994689. Further analysis and sequencing of the corresponding full-length 
clone resulted in the isolation of DNA77624 (Fig. 87, SEQ ID NO: 192) and the derived PRO 1868 native 
sequence protein UNQ859 (Fig. 88, SEQ ID NO: 193). 

Clone DNA77624 (Fig. 88, SEQ ID NO: 193) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 51-53 and ending at the stop codon (TGA) at nucleotide 

15 positions 981-983. as indicated by bolded underline. The predicted PROI868 polypeptide precursor (i.e.. 
UNQ859. SEQ ID NO: 193. Fig. 89) is 310 amino acids long. The UNQ859 {SEQ ID NO: 193) protein shown 
in Figure 89 has an estimated molecular weight of about 35.020 daltons and a pi of about 7.90, a 
transmembrane domain from about amino acid residue 243 to about residue 263. potential N-glycosylation 
sites at about amino acid residues 104 and 192. a cAMP- and cGMP-dependent protein kinase phosphorylation 

20 site from about amino acid residues 107 to about residue 110. casein kinase II phosphorylation sites from about 
amino acid residues 106 to about residue 109 and from about amino acid residue 296 to about residue 299, a 
tyrosine kinase phosphorylation site from about amino acid residue 69 to about residue 77 and potential N- 
myristolation sites from about amino acid residue 26 to about residue 31. from about residue 215 to about 
residue 220. from about residue 226 to about residue 231. from about residue 243 to about residue 248. from 

25 about residue 244 to about residue 249 and from about residue 262 to about residue 267. A cDNA clone 
containing DNA77624 (SEQ ID NO: 193) has been deposited with ATCC on December 22. 1998 and is 
assigned ATCC deposit no 203553. 

30 AS. Isolation of cDNA clones Encoding Human PRO205 (UNO 179) 

Use of the ECD procedure above resulted in the identification of an EST sequence derived from a 
human retinal library. Additional effort to identify the full length clone using an in vitro cloning procedure 
were unable to identify another PRO205 encoding DNA sequence. 

DNA sequence encoding other polypeptide of substantial homology to the UNQ179 (SEQ ID 
35 NO:229) polypeptide of Figure 90 may be found as GenBank submissions AB033089J and HSM802 147 J . 

Clone DNA30868 (SEQ ID NO:89) contains what is believed to be an incomplete open reading frame 
with an apparent translation initiation site at nucleotide positions 405-407 as indicated by bolded underline in 
Figure 89. The predicted partial length PR01868 polypeptide precursor (/.*., UNQ179, SEQ ID NO:229) is 
343 amino acids long, has a calculated molecular weight of 39285 daltons and a pi of 6.06. 
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Analysis of the UNQ179 (SEQ ID NO:229) shown in Figure 90 reveals a signal peptide at about 
amino acid residues 1 to 20, an N-glycosyiation site at about amino acid residues 318-322, tyrosine kinase 
phosphorylation sites at about amino acids residues 21-29 and 21 1-220, N-myristolation sites at about residues 
63-69, 83-89 and 317-323 and a prokaryotic membrane lipoprotein lipid attachment site at about residues 260- 
5 271. A cDNA clone containing DNA30868 (SEQ ID NO:228) has been deposited with the ATCC on March 2, 
2000 under the designation DNA30868-1 156 and has been assigned ATCC deposit no. . 

AT. Isolation of cDNA clones Encoding Murine PRQ2I (UNQ21) 

The isolation of DNA36638 (Fig. 91, SEQ ID NO:230), which encodes the native sequence PR02I 
10 polypeptide UNQ21 (Fig. 92, SEQ ID NO:231) has been previously described in U.S.P. 5,955,420. Additional 
cloning and characterizing information can be found in Schneider et aL Cell 54 (6): 787-93 (1988) and in 
Manfioletti et ai, Mol Cell Biol \2 (8): 4976-85 (1993). , 

Clone DNA36638 contains a single open reading frame with an apparent translation initiation site at 
nucleotide residues 168-170 and ending at the stop codon (TAG) at nucleotide residues 2187-2189 (Figure 91), 
15 as indicated by boldcd underline. The predicted PR021 polypeptide precursor He.. UNQ21, SEQ ID NO:231) 
is 673 amino acids long, has a calculated molecular weight of 74,512 daltons and a pi of 5.45. A cDNA clone 
containing DNA36638 has been deposited with the ATCC under the designation DNA36638-I056 on 
November 12, 1997 and has been assigned ATCC deposit number 209456. 

Analysis of the UNQ21 polypeptide of Figure 92 (SEQ ID NO:23 1) reveals a signal sequence at about 
20 amino acid residues 1-27, a transmembrane domain at about amino acid residues 619-635, N-glycosyiation 
sites at about residues 417-421 and 488-492, N-myristolation sites at about amino acid residues 126-132, 135- 
141, 146-152, 173-179. 214-220, 253-259, 346-352, 374-380, 440-446. 479-485, 497-503, 517-523, 612-618, 
aspanic acid and asparagine hydroxylation sites at about amino acid residues 130-142. 168-180, 209-221 and 
248-260, a vitamin K-dependent carboxylation domain and an EGF-Iike domain cysteine pattern signature at 
25 about amino acid residues 139-151. 



AU. Isolation of cDNA clones Encoding Human PRQ269 (UNQ236) 

Use of the ECD homology procedure described above in a human fetal kidney library in combination 

with an in vitro cloning procedure using the probe oligonucleotide and one of the primer pairs below resulted 
30 in the identification of the full length DNA sequence DNA38260 (Fig. 93, SEQ ID NO:232) and the derived 

PR0269 native sequence protein UNQ236 (Fig. 94, SEQ ID NO:233). 

The forward and reverse PCR primers and the hybridization probe used were the following: 

forward PCR primer (.fl): (SEQ ID NO: 234) 

5 t -TGGAAGGAGATGCGATGCCACCTG -3' 
35 forward PCR primer (.£2): (SEQ ID NO:235) 

5'-TGACCAGTGGGGAAGGACAG-3' 

forward PCR primer (.0): (SEQ ID NO:236) 

5VACAGAGCAGAGGGTGCCTTG-3' 

reverse PCR primer (.rl ): (SEQ ID NO:237) 

40 SVTCAGGGACAAGTGGTGTCTCTCCC-T 
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reverse PCR primer( .r2): (SEQ ID NO:238), 

5 , -TCAGGGAAGGAGTGTGCAGTTCTG-3 , 
hybridization probe : (SEQ ID NO:239) 

5 , -ACAGCTCCCGATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCT-3 , 
5 Clone DNA38260 (SEQ ID NO:232) contains a single open reading frame with an apparent 

translation initiation site at nucleotide positions 314-316 and ending at the stop codon (TAG) at nucleotide 
positions 1784-1786 (Fig. 93), as indicated by bolded underline. The predicted PR0269 polypeptide precursor 
is 490 amino acids long (/.<?., UNQ236, Fig. 94, SEQ ID NO:233), has a calculated molecular weight of 51,636 
daltons and a pi of 6.29. A cDNA clone containing DN A3 8260 (SEQ ID NO:232) has been deposited with 

10 ATCC on October 17, 1997 and is assigned ATCC deposit no. 209397. 

Analysis of the UNQ236 polypeptide of Figure 94 (SEQ ID NO:223) reveals a signal sequence at 
about amino acid residues 1-16, a transmembrane domain at about residues 399-418, N-glycosylation sites at 
about amino acid residues 189-193 and 381-385, a glycosaminoglycan attachment site at about amino acid 
residues 289-293, cAMP- and cGMP-dependent protein kinase phosphorylation sites at about amino acid 

15 residues 98-102 and 434-438. N-myristolation sites about amino acid residues 30-36, 35-41. 58-64. 59-65, 121- 
127, 151-157. 185-191. 209-215. 267-273. 350-356. 374-380. 453-459, 463-469 and 477-483 and an aspartic 
acid and asparagine hydroxylation site at about amino acid residues 262-274. 



AV. Isolation of cDNA Encoding Human PRQ344 (UNQ303) 
20 Use of the ECD homology procedure described above in a human fetal kidney library in combination 

with an in vitro cloning procedure using the probe oligonucleotide and one of the pnmer pairs, below resulted 

in the identification of the full length DNA sequence DNA40592 (Fig. 95, SEQ ID NO:240) and the derived 

PR0344 native sequence protein UNQ303 (Fig. 96, SEQ ID NO:241). 

The forward and reverse PCR primers and the hybridization probe used were the following: 
25 forward PCR primer (34398. fl): (SEQ ID NO:242) 

5 , -TACAGGCCCAGTCAGGACCAGGGG-3* 

forward PCR primer (34398.f2): (SEQ ID NO:243) 

5 , -AGCCAGCCTCGCTCTCGG-3' 

forward PCR primer (34398.0): (SEQ ID NO:244) 

30 5 , -GTCTGCGATCAGGTCTGG-3* 

reverse PCR primer (34398.r I ): (SEQ ID NO:245) 

5'-GAAAGAGGCAATGGATTCGC-3* 

reverse PCR primer (34398.r2): (SEQ ID NO:246) 

5'-GACTTACACTTGCCAGCACAGCAC-3' 
35 - hybridization probe (34398.pl): (SEQ ID NO:247) 

5-GGAGCACCACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAG-3' 

Clone DNA40592 (SEQ ID NO:240) contains a single open reading frame with an apparent 

translation initiation site at nucleotide positions 227-229 and ending at the stop codon (TAG) at nucleotide 

positions 956-958 (Figure 95). The predicted PR0344 polypeptide precursor (i.e.. UNQ303, SEQ ID NO:241) 
40 is 243 amino acids long (Figure 96), has a calculated molecular weight of 25,298 daltons and a pi of 6.44. 
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Analysis of the UNQ303 polypeptide of Figure 96 (SEQ ID NO:24!) reveals a signal peptide at about amino 
acid residue 1-15, N-myristoIation sites at about amino acid residues 11-17. 68-74, and 216-222 and a cell 
attachment site at about amino acid residues 77-80. A cDNA clone containing DNA40592 (SEQ ID NO:240) 
has been deposited with ATCC on November 21, 1997 and is assigned ATCC deposit no. 209492. 

5 

AX. Isolation of cDNA clones Encoding Human PRQ333 f UNQ294) 

Use of the ECD homology procedure in combination with an in vivo cloning procedure resulted in the 
identification of the partial length sequence DNA41374 (SEQ ID NO:248, Figure 97). 

Clone DNA41374 (SEQ ID NO:248) contains an incomplete open reading frame with an apparent 
10 translation termination site (i.e.. stop codon, TGA) at nucleotide residues 1185-1187. as indicated in bolded 
underline. The predicted partial length PR0333 polypeptide (i.e., UNQ294, SEQ ID NO:249) is 394 amino 
acids long, a calculate molecular weight of 43,725 daltons and a pi of 8.36. 

Analysis of the UNQ294 (SEQ ID NO:249) polypeptide of Figure 98 reveals a signal sequence at 
about amino acid residues 1-14, a transmembrane domain at about residues 359-376, N-myristoylation sites at 
15 about amino acid residues 166-172. 206-212. 217-223. 246-252, 308-314. 312-318. 361-367 and an 
immunoglobulin and major histocompatibility complex proteins signature at amino acid residues 315-323. A 

cDNA clone containing DNA41374 has been deposited with the ATCC on and as assigned 

ATCC deposit number . 

20 AY. Isolation of cDNA clones Encoding Human PRQ381 (UNQ322) 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
identification of the mil length DNA sequence DNA44I94 (Fig. 99, SEQ ID NO:250) and the derived PR0381 
native sequence protein UNQ322 (Fig. 100. SEQ ID NO:251). 

The forward and reverse PCR primers and the hybridization probe used were the following: 
25 Forward PCR primer (3965 1 . fl ): (SEQ ID NO:252) 

5' -CTTTCCTTGCTTCAGCAACATG AG GC -3' 

Reverse PCR primer (3965 1 .r 1 ): - - (SEQ ID NO:253) 

5 , -GCCCAGAGCAGGAGGAATGATGAGC-3 , 

hybridization probe (3965 1 .pi): (SEQ ID NO:254) 

30 5'-GTGGAACGCGGTCTTGACTCTGTTCGTCACTrCTT^ 

Clone DNA44194 (SEQ ID NO:250) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 174-176 and ending at the stop codon (TAG) at nucleotide 
positions 807-809 (Fig. 99), as indicated by bolded underline. The predicted PR038 1 polypeptide precursor 
(le.. UNQ322, Fig. 100, SEQ ID NO:251) is 211 amino acids long, has a calculated molecular weight of 

35 24,172 daltons and has a pi of 5.99. The UNQ322 (SEQ ID NO:25l) protein shown in Figure 100 has the 
following features: a signal peptide from about amino acid residues 1 to about 20, a potential N-glycosylation 
site at about amino acid residue 156, potential casein kinase phosphorylation sites from about amino acid 
residues 143 to about 146, about residues 156 to about 159, about residues 178 to about 181, about residues 
200 to about 203, an endoplasmic reticulum targeting sequence from about amino acid residues 78 to about 1 14 

40 and from about residues 1 18 to about 131, EF-hand calcium binding domain from about amino acid residues 
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140 to about 159, and an S-100/ICaBP type calcium binding domain from about amino acid residues 183 to 
about 203. A cDNA clone containing DNA44194 (SEQ ID NO:250) has been deposited with the ATCC on 
April 28. 1998 and is assigned deposit number 209808. 

AZ. Isolation of cDNA clones Encoding Murine PRO720 (UNQ388) 

The preparation of DNA53517 (SEQ ID NO:255) is described above under "X. Isolation of cDNA 
clones Encoding Human PRO7 70 (UNO408) ." Clone QNA53517 (SEQ ID NO:255) contains a single open 
reading frame with an apparent translation initiation site at nucleotide residues 36-38 and ending at the stop 
codon (TAA) at 369-371 (Figure 101), as indicated by bolded underline. The predicted PRO720 polypeptide 
precursor (i.e.. UNQ388, SEQ ID NO:256) is 1 11 amino acids long (Figure 102), has a calculated molecular 
weight of 1 1,936 daltons and a pi of 5.2 1 . 

Analysis of the UNQ388 (SEQ ID NO:256) polypeptide of Figure 102 reveals a signal sequence at 
about amino acid residues I -23, N-myristolation sites at about ammo acids residues 70-76 and 75-81 and 
prokaryotic membrane lipoprotein lipid attachment sites at 66-77 and 68-79. A cDNA clone containing 
DNA53517 (SEQ ID NO:255) has been deposited with the ATCC on April 23. 1998 and is assigned deposit 
number 209802. 

BA. Isolation of cDNA clones Encoding Human PRQ866 (UNQ435) 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
identification of the full length DNA sequence DNA53971 (Fig. 103, SEQ ID NO:257) and the derived 
« imiuvv. bv.qui-uwe^ruicai ui><v4_>5 trig, ickk ^Q ijj ^V:25H). 
The forward and reverse PCR primers and the hybridization probe used were the following: 
Forward PCR primer (44708.fl): (SEQ ID NO:259) 

5'-CAGCACTGCCAGGGGAAGAGGG-3* 

Forward PCR primer (44708.O): (SE q id NO: 260) 

5'-CAGGACTCGCTACGTCCG-3* 

Forward PCR primer (44708. D): (SEQ ID NO:261) 

S'-CAGCCCCTTCrCCTCCTTTCTCCCO* 

Reverse PCR primer (44708.rl): (SE q id NO:262) 

5'-GCAGTTATCAGGGACGCACTCAGCC-3 , 

Reverse PCR primer (44706.r2): (SE q id NO:263) 

5 , -CCAGCGAGAGGCAGATAG-3 1 

Reverse PCR primer (44706.r3): - (S E q (£> NO:264) 

5 , -CGGTCACCGTGTCCTGCGGGATG-3 t 

hybridization probe (44708.p 1 ): (SE q id NO: 265) 

5 , ^AGCCCCTrCTCCTCCTTTCTCCCACGTCCTATCrGCCTCTC-3' 

The clone DNA53971 (SEQ ID NO:257) contains a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 275-277 and ending at the stop codon (TAA) at nucleotide 
positions 1268-1270 (Figure 103), as indicated by bolded underline. The predicted native sequence PRO866 
polypeptide precursor (U. UNQ435, SEQ ID NO:258) is 331 amino acids (Figure 104), has a calculated 
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molecular weight of 35,844 daltons and a pi of 5.45. The UNQ435 (SEQ ID NO:258) protein shown in figure 
104 has an estimated molecular weight of about 35.844 daltons and a pi of about 5.45. Further analysis reveals 
a signal peptide from about amino acid residue 1 to about residue 26, glycosaminoglycan attachment sites at 
about amino acid residues 131-135, cAMP- and cGMP-dependent protein kinase phosphorylation sites at about 
5 amino acid residues 144-148 and N-myristoylation sites at amino acid residues 26-32, 74-80, 132-138, 134- 
140, 190-196, 287-293 and 290-296. A cDNA clone containing DNA53971 (SEQ ID NO:257) has been 
deposited with the ATCC on April 6, 1998 and is assigned deposit no. 209750. 

BB. Isolation of cDNA clones Encoding Human PRO840 (UNQ433) 

10 The use of a yeast screen procedure on tissue isolated from a human thyroid library resulted in an EST 

sequence which served as the template for the creation of PCR oligonucleotides which ultimately resulted in 
the isolation of DNA53987 (SEQ ID NO:266, Figure 105) and the derived PRO840 native sequence protein 
UNQ433 (SEQ ID NO:267, Figure 106). 

A nucleotide sequence encoding a polypeptide of substantial homology with UNQ433 (SEQ ID 

15 NO:267) of Figure 106 is also available from GenBank as accession number HEEPSSARC_l. 

DNA53987 (SEQ ID NO:266) as shown in Figure 105 contains an open reading frame with a 
translation initiation site at about nucleotide residues 18-20 and ending at the stop codon (TGA) at nucleotide 
residues 1329-133 1 . as indicated by bolded underline. The second methionine codon at nucleotide residues 90- 
92 could possibly also be the actual translation initiation site - alternatively, this codes for an internal 

20 methionine. The predicted PRO840 polypeptide (Le.. the longer translation) has been termed UNQ433 (SEQ 
ID N'0:267) and is 437 amino acids long (Figure i06), has a calculated molecular weight of 49,851 daitons and 
a pi of 6.47. 

A cDNA clone containing DNA53987 (SEQ ID NO:266) has been deposited with the ATCC on May 
12, 1998 under ATCC deposit number 209858. 
25 Analysis of the UNQ433 polypeptide of Figure 106 (SEQ ID NO:267) reveals a signal sequence at 

about amino acid residues 1-46. a transmembrane domain at about amino acid residues 319-338. an N- 
giycosylation site at about residues 200-204, a cAMP and cGMP-dcpendent protein kinase phosphorylation 
sites at amino acid residues 23-27, tyrosine kinase phosphorylation sites at amino acid residues 43-52 and N- 
myristolylauon sites at residues 17-23, 1 12-118, 1 16-122 and 185-191. 

30 

BC. Isolation of cDNA clones Encoding Human PRQ982 (UNQ483) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence no. 43715. This sequence was then compared to a variety of various EST databases as described 
under the signal algorithm procedure above, and further resulted in the identification of Merck EST No. 
35 AA024389. The full-length clone corresponding to this EST resulted in the identification of the fall-length 
sequence DNA57700 (Fig. 107, SEQ ID NO:268) and the derived PR0982 native sequence protein UNQ483 
(Fig. I08,SEQIDNO:269). 

The DNA57700 sequence of Figure 107 (SEQ ID NO:268) contains a single open reading frame with 
an apparent translation initiation site at nucleotide positions 26-28 and ending at the stop codon (TAA) found at 
40 nucleotide positions 401-403, as indicated by bolded underline. The prediced PR0982 polypeptide precursor 
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(/.<?., UNQ982, SEQ ID NO:191) is 124 amino acids in length, has a calculated molecular weignt ot 
approximately 14,198 daltons and an estimated pi of approximately 9.01 (Fig. 108). Further analysis of the 
UNQ483 (SEQ ID NO:269) polypeptide of Figure 108 reveals a signal peptide from about amino acid residues 
I to about 21 and potential anaphylatoxin domain from about amino acid residue I to about residue 59. A 
cDNA clone containing DNA57700 (SEQ ID NO:268) was deposited with the ATCC on January 12, 1999 and 
is assigned ATCC deposit No. 203583. 

BD. Isolation of cDNA clones Encoding Human PRQ836 (UNQ545) 

Use of the signal algorithm procedure described above resulted in the identification of EST clusters 
which were then compared to a variety of various EST databases as described under the signal algorithm 
procedure above, and further resulted in the identification of Incyte EST 2610075, an EST derived from colon 
tumor tissue. The full-length clone corresponding to this EST resulted in the identification of the full-length 
sequence DNA59620 (Fig. 109, SEQ ID NO:270) and- the derived PR0836 native sequence protein UNQ545 
(Fig. U0,SEQIDNO:27l). 

The nucleotide sequence DNA59620 (SEQ ID NO:270) shown in Figure 109 contains a single open 
reading frame with an apparent translation initiation site at nucleotide positions 65-67 and ending at the stop 
codon (TGA) at nucleotide positions 1448-1450 (Fig. 109), as indicated by bolded underline. The predicted 
PR0836 polypeptide precursor (U>.. UNQ545, Fig. 110, SEQ ID NO:271) is 461 amino acids in length. 
UNQ545 (SEQ ID NO:27I) shown in Figure 1 10 has an estimated molecular weight of about 52,085 daltons 
and a pi of about 5.36. Further analysis reveals a signal peptide at about amino acid residues 1 to about 29, N- 
°' J — J' — *" .wiuuw iyj ami zjo ana N-mynstoyiaaon sites at aoout resiaues 

19, 234. 25 U 402 and 451, a domain conserved in the YJL126w/YLR35lc/yhcX family of proteins at about 
amino acid residues 364 to about 372, and a region having sequence identity with SLS 1 protein at about amino 
acid residues 68 to about 340. 

A cDNA clone containing DNA59620 (SEQ ID NO:270) has been deposited with the ATCC on 16 June 1998 
and is assigned deposit number 209989. 

BE. Isolation of cDNA clones Encodine Human PROi 159 (UNQ589) 

Use of the signal algorithm procedure described above resulted in the identification of EST cluster 
sequence 77245, which was then compared to a variety of various EST databases as described under the signal 
algorithm procedure above, and further resulted in the identification of Incyte EST no. 376776. Analysis of the 
full-length clone corresponding to this EST resulted in the identification of the full-length sequence 
DNA60627 (Fig. 1 1 1, SEQ ID NO:272) and the derived PROI 159 native sequence protein UNQ589 (Fig. 1 12, 
SEQ ID NO:273). 

Clone DNA60627 (SEQ ID NO:272) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 92-94 and ending at the stop codon (TAG) at nucleotide 
positions 362-364 (Figure 111), as indicated by bolded underline. The predicted PROI 159 polypeptide 
precursor (*.e., UNQ589, SEQ ID NO:273) is 90 arnino acids long (Figure 112). The UNQ589 (SEQ ID 
NO:273) protein shown in Figure 1 12 has an estimated molecular weight of about 9,840 daltons and a pi of 
about 10.13. 
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Analysis of the UNQ589 (SEQ ID NO:273) sequence shown in Figure 1 12 evidences the presence ot 
the following: a signal peptide from about amino acid residue I to about residue 15 and a potential N- 
giycosylation site at about amino acid residue 38. Clone DNA60627 (SEQ ID NO:272) has been deposited 
with ATCC on August 4, 1998 and is assigned ATCC deposit no. 203092. 

5 

BF. Isolation of cDNA clones Encoding Human PRQ1358 (UNQ707) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence, which was then compared to a variety of various EST databases as described under the signal 
algorithm procedure above, and further resulted in the identification of Incyte EST 0887 1 8, a fragment derived 

10 from a liver tissue library. Analysis of the full-length clone corresponding to the EST resulted in the 
identification of the full-length sequence DNA64890 (Fig. 1 13, SEQ ID NO:274) and the derived PR01358 
native sequence protein UNQ707 (Fig. 1 14, SEQ ID NO:275). 

The DNA64890 (SEQ ID NO:274) clone shown in Figure 113 contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 86 through 88 and ending at the stop codon 

15 (TAA) found at nucleotide positions 1418 through 1420 (Figure 1 13). as indicated by bolded underline. The 
predicted PRO 1 358 polypeptide precursor (i.e.. LTNQ707, SEQ ID NO:275) is 444 amino acids long, and a 
signal peptide is at about amino acid residues 1-18. UNQ707 (SEQ ID NO:275) has a calculated molecular 
weight of approximately 50719 daltons and an estimated pi of approximately 8.82. A cDNA clone containing 
DNA64890 (SEQ ID NO:274). designated as DNA64890-I612, was deposited with the ATCC on August 18, 

20 1 998 and is assigned ATCC deposit no. 203 1 3 1 . 

BG. Isolation of cDNA clones Encoding Human PRO 1325 (UNQ685) 

Use of the signal algorithm procedure described above resulted in the identification of the EST cluster 
sequence no. 139524, which was then compared to a variety of various EST databases as described under the 
25 signal algorithm procedure above, and further resulted in the identification of Incyte EST 3744079. Analysis 
of the full-length clone corresponding to the EST resulted in the identification of the full-length sequence 
DNA66659 (Fig. 1 15, SEQ ID NO:276) and the derived PROI325 native sequence protein UNQ685 (Fig. 1 16, 
SEQ ID NO:277). 

Clone DNA66659 (Fig. 1 15. SEQ ID NO:276) contains a single open reading frame with an apparent 
30 translation initiation site at nucleotide positions 51-53 and ending at the stop codon (TAG) at nucleotide 
positions 2547-2549, as indicated by bolded underline. The predicted PR01325 polypeptide precursor (i.e.. 
UNQ685, SEQ ID NO:227) is 832 amino acids long. The UNQ685 (SEQ ID NO:227) protein shown in Figure 
1 16 has an estimated molecular weight of about 94,454 daltons and a pi of about 6.94. Further analysis of 
UNQ685 (SEQ IDNO:227) reveals: a signal peptide from about amino acid I to about amino acid 18, 
35 transmembrane domains from about amino acid 292 to about amino acid 317, from about amino acid 451 to 
about amino acid 470, from about amino acid 501 to about amino acid 520, from about amino acid 607 to 
about amino acid 627 from about amino acid 751 to about amino acid 770, a leucine zipper pattern sequence 
from about amino acid 497 to about amino acid 518 and potential N-glycosylation sites from about amino acid 
27 to about amino acid 30, from about amino acid 54 to about amino acid 57, from about amino acid 60 to 
40 about amino acid 63, from about amino acid position 123 to about amino acid position 126, from about amino 
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acid position 141 to about amino acid position 144, from about amino acid position 165 to about amino acid 
position 168, from about amino acid position 364 to about amino acid position 367, from about amino acid 
position 476 to about amino acid position 479, from about amino acid position 496 to about amino acid 
position 499, from about amino acid position 572 to about amino acid position 575, from about amino acid 
5 position 603 to about amino acid position 606 and from about amino acid position 699 to about amino acid 
position 702. A cDNA clone containing DNA66659 (SEQ ID NO;276)) has been deposited with ATCC on 
September 22, 1998 and is assigned ATCC deposit no. 203269. 

. BH. Isolation of cDNA clones Encoding Human PRO 1338 (UNQ693) 

10 The use of yeast screens resulted in EST sequences which were then compared to various public and 

private EST databases in a manner similar to that described above under ECD homology resulted in the 
identification of Incyte EST2615184, an EST derived from cholecystitis gall bladder tissue. Analysis of the 
corresponding full-length sequence ultimately resulted in the isolation of DNA66667 (SEQ ID NO:278, Figure 
1 17) and the derived PRO 1 338 native sequence protein UNQ693 (SEQ ID NO:279, Figure 1 18). 

15 DNA66667 (SEQ ID NO:278) as shown in Figure 1 17 contains, a single open reading frame with a 

translation initiation site at about nucleotide residues 115-117 and ending at the stop codon (TAA) at 
nucleotide positions 2263-2265. as indicated by bolded underline. The predicted PRO 1 338 polypeptide 
precursor (i.e.. UNQ693. SEQ ID NO:ll8) is 716 amino acids in length (Figure 118), has a calculated 
molecular weight of 80.716 daltons and a pi of 6.06. 

20 Analysis of ihe UNQ693 polypeptide (SEQ ID NO:278) of Figure 118 reveals a signal sequence at 

about amino acid residues I to 25, a transmembrane domain at about amino acid residues 629-648, N- 
glycosylation sites at about amino acid residues 69-73, 96-100, 106-1 10, 1 17-121, 385-389, 517-521, 582-586 
and 611-615, a tyrosine kinase phosphorylation site at about residues 573-582 and N-myristoylation sites at 
about amino acid residues 1 6-22, 224-230. 464-470, 637-643 and 698-704. 

25 A cDNA containing DNA66667 (SEQ ID NO:278) has been deposited with the ATCC under the 

designation DNA66667-1596 on September 22. 1998 and has been assigned ATCC deposit number 203267. 

BI. Isolation of cDNA clones Encoding Human PRO 1434 (UNQ739) 

Use of ECD homology procedure described above in a human retina tissue library resulted in the 
30 identification of the full-length DNA sequence DNA68818 (Fig. 119. SEQ ID NO:280) and the derived 
PRO 1434 native sequence protein UNQ739 (Fig. 120, SEQ ID NO:28I). 

The PCR primers (forward and reverse) and hybridization probe synthesized in this procedure were 
the following: 
forward PCR primer 

35 5*-G AGGTGTCGCTGTGAAGCCAACGG-3* (SEQ ID NO:282) 

reverse PCR primer 

5 , ^GCTCGATTCTCCATGTGCCTTCC-3' (SEQ ID NO:283) 

hybridization probe : (SEQ ID NO:284) 

5 , -GACGGAGTGTGTGGACCCTGTGTACGAGCCTGATCAGTGCTGTCC-3' 
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Clone DNA68818 (SEQ ID NO:280) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 581-583 and ending at the stop codon (TAG) at nucleotide 
positions 1556-1558 (Figure 119), as indicated by bolded underline. The predicted PR01434 polypeptide 
precursor (Le.. UNQ739, SEQ ID NO:28l) is 325 amino acids long (Figure 120). The UNQ739 (SEQ ID 
NO:281) protein shown in Figure 120 has an estimated molecular weight of about 35,296 daitons and a pi of 
about 5.37. Further analysis reveals a signal sequence at about amino acid residues 1-27, a giycosaminogiycan 
attachment.site at about amino acid residues 80-84 ; M-myristoylation sites at about amino acid residues 10-16, 
102-108, 103-109, a ceil attachment sequence at about amino acid residues 1 14-1 17 and an EGF-Iike domain 
cysteine pattern signature at about amino acid residues 176-188. 

A clone containing DNA68818 (SEQ ID NO:280) has been deposited with ATCC under the 
designation DNA688 18-2536 on February 9, 1999 and is assigned ATCC deposit no. 203657. 



BJ. Isolation of cDNA clones Encoding Human PRQ4333 (UNO 1888) 

An expressed sequence tag (EST) DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) 
1 5 was searched in a manner similar to that described above under the ECD homology procedure described above 
and an EST was identified which showed homology to lymphotoxin-beta receptor. 

The EST served as the template to create oligonucleotide primers and probes to screen a human fetal 
kidney library in a manner similar to that described above under the ECD homology procedure. 
The oligonucleotides created for the above procedure were the following: 
20 forward PCR primer: (SE q id N 0:287) 

5'-GCAAGAATTCAGGGATCGGTCTGG-3' 

P robe: (SEQ ID NO:288) 

5 , -CTGTGTTCCCTGCAACCAGTGTGGGCCAGGCATGG AGTTGTCTAAGG-3* 

reversc: (SEQ ID NO:289) 

25 5'-AGATGCCATCACTG GTGGCTGAAC-3' 

fonvard: (SEQ ID NO:290) 

5'-CAGAAGGCAAATTGTTCAGCCACCAG-3' 

revcrse: (SEQIDNO:291) 
5 , -ACAGTTTCCAGACCGATCCCTGAATTC-3* 

30 The result was the isolation of the full-length DNA sequence DNA842I0 (SEQ ID NO:285, Figure 

121). The DNA84210 (SEQ ID NO:285) clone depicted in Figure 121 contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 185-187, and a stop codon (TAA) at 
nucleotide positions 1436-1438, as indicated by bolded underline. The predicted PR04333 polypeptide 
precursor (/>.. UNQI888, SEQ ID NO:286) is 417 amino acids long. The UNQ1888 protein (SEQ ID 

35 NO:286) shown in Figure 121 has an estimated molecular weight of about 45305 daitons and a pi of about 
5.12. 

Analysis of the UNQ1888 polypeptide (SEQ ID NO:286) of Figure 121 reveals a signal peptide at 
about amino acid residues 1-25, a transmembrane domain at about residues 169-192, N-glycosylation sites 
about residues 105-109, 214-218, 319-323, 350-354, 368-372, 379-383, cAMP- and cGMP-dependent protein 
40 kinase phosphorylation sites at about residues 200-204 and 238-242, a tyrosine kinase phosphorylation site at 
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about residues 207-214, an N-myristoylation site at about residues 55-61, 215-218 and 270-276, a prokaryotic 
membrane lipoprotein lipid attachment site at about residues 259-270 and a TNFR/NGFR family cysteine-rich 
region at about residues 89-96. 

A cDNA clone containing DNA84210 (SEQ ID NO:285), designated as DNA842 10-2576, has been 
5 deposited with ATCC on March 2, 1999 and is assigned ATCC deposit no. 203818. 

BK. Isolation of cDNA clones Encoding Human PRO4302 (UNO 1866) 

Use of the amylase screen procedure described above on tissue isolated from human tissue resulted in 
an EST sequence which was then compared against various EST databases to create a consensus sequence by a 

10 methodology as described above under the amylase yeast screen procedure and/or the ECD homology 
procedure. Further analysis of this consensus sequence resulted in the identification of Incyte EST no. 
2408081 HI. Analysis of the full-length clones corresponding to EST no. 240808 IH1 resulted in the isolation 
of the full length native sequence clones DNA922I8 (SEQ ID NO:292) and the derived PRO4302 full-length 
native sequence protein UNQ1866 (SEQ ID NO:293). 

15 ^ fo 11 length clone DNA92218 (SEQ ID NO:292) shown in Figure 123 has a single open reading 

frame with an apparent translaiional initiation site at nucleotide positions 174-176 and a stop signal (TAG) at 
nucleotide positions 768-770, as indicated by bolded underline. The predicted PRO4302 polypeptide precursor 
(i.e.. UNQ1866. SEQ ID NO:293) is 198 amino acids long, has a calculated molecular weight of approximately 
22,285 daltons and an estimated pi of approximately 9.35. Analysis of UNQ1866 (Fig. 124, SEQ ID NO:293) 

20 reveals a signal peptide from about amino acid residue 1 to about residue 23, a transmembrane domain from 
about ammo acid residue 111 to about residue 130, a cAMP and cGMP-dependent protein kinase 
phosphorylation sites at residues 26-30. casein kinase II phosphorylation sites at residues 44-47 and 58-61, a 
tyrosine kinase phosphorylation site at residues 36-43 and N-myristoylation sites at residues 124-130. 144-150 
and 189-195. 

25 A cDNA clone containing DNA92218 (SEQ ID NO:292), designated DNA922 1 8-2554, was 

deposited with the ATCC on March 9. 1999 and has been assigned deposit number 203834. 

BL. Isolation of cDNA clones Encoding Human PRO4430 (UNQ1947) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 

30 sequence, which was then compared to a variety of various EST databases as described under the signal 
algorithm procedure above, and further resulted in the identification of a consensus sequence. Further analysis 
of the consensus sequence resulted in the identification of the full-length sequence DNA96878 (Fig. 125, SEQ 
ID NO:294) and the derived PRO4430 native sequence protein UNQ1947 (Fig. 126, SEQ ID.NO:295). 

The native sequence DNA sequence DNA96878 (SEQ ID NO:294) shown in Figure 125 contains a 

35 single open reading frame with an apparent translation initiation site at nucleotide positions 56-58 and ending 
at the stop codon (TGA) found at nucleotide positions 431-433, as indicated by bolded underline. The 
predicted PRO4430 polypeptide precursor (UNQ1947, Fig. 126, SEQ ID NO:295) is 125 amino acids long. 
The UNQ4430 protein (SEQ ID NO:295) of Figure 126 has a calculated molecular weight of approximately 
13821 daltons and an estimated pi of approximately 8.6. Further analysis reveals the presence of a signal 

40 sequence at about amino acid residues 1 to about 18, N-glycosylation sites at about residues 77-80 and again at 
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about residues 88-91, a casein kinase II phosphorylation site at about residues 67-70, an N-myristoylation site 
at about residues 84-89 and a Lys-6/u-PAR domain at about residues 85-98. 

A clone containing DNA96878 (SEQ ID NO:294), designated DNA96878-2626, was deposited with 
the ATCC on May 4, 1 999 and is assigned ATCC deposit no. 23-PTA. 

BM. Isolation of cDNA clones Encoding Human PRQ5727 (UNQ2448) 

Various known TNF-receptors were used to screen public and private EST databases (e.g.. see ECD 
homology procedure, above) resulting in the identification Incyte clone 509151 1H. This EST sequence, which 
was derived from uterine tumor tissue, then served as a template for the construction of the cloning oligos 
indicated below which were then used to identify by PCR a human thymus cDNA library that contamed the 
sequence of interest. These oligonucleotides were: 
Forward primer (509-1): 

5 '-GAGGGGGCTGGGTG AG ATGTG-3 * (S EQ ID NO:298) 

Reverse primer (509-4AS): 

5'-TGCTTTTGTACCTGCGAGGAGG-3' {S £Q ID NO:299) 

To isolate the DNA sequence encoding the full-length DNA98853 polypeptide, an inverse long 
distance PCR procedure was carried out (Figure 129). The PCR primers generally ranged from 20 to 30 
nucleotides. For inverse long distance PCR, primer pairs were designed in such a way that the 5* to 3* direction 
of each primer pointed away from each other. 

A pair of inverse long distance PCR primers for cloning DNA98853 were synthesized: 
Primer 1 (left primer) (509-P5): 

5 , -pCATGGTGGGAAGGCCGGTAACG-3' {SE q id NO: 300) 

Primer 2 (right primer) (509-P6): 

5'-pGATTGCCAAGAAAATGAGTACTGGGACC-3' (SEQ ID NO:301 ) 

In the inverse long distance PCR reaction, the template is the plasmid cDNA library. As a result, the 
PCR products contain the entire vector sequence in the middle with insert sequences of interest at both ends. 
After the PCR reaction, the PCR mixture was treated with Dpn I which digests only the template plasmids, 
followed by agarose gel purification of PCR products of larger than the size of the library cloning vector. 
Since the primers used in the inverse long distance PCR were also 5'-phosphoryIated. the purified products 
were then self-ligated and transformed into E.coli competent cells. Colonies were screened by PCR using 5* 
vector primer and proper gene specific primer to identify clones with larger 5' sequence. Plasmids prepared 
from positive clones were sequenced. If necessary, the process could be repeated to obtain more 5' sequences 
based on new sequence obtained from the previous round. 

The purpose of inverse long distance PCR is to obtain the complete sequence of the gene of interest. 
The clone containing the full length coding region was then obtained by conventional PCR. 

The primer pair used to clone the full length coding region of DNA98853 (SEQ ID NO:296) were the 
following: 

Forward primer (Cla-MD-509): 

5 , -GGAGGATCGATACCATGGATTGCCAAGAAAATGAG-3' (SEQ ID NO:302) 

Reverse primer (509.TAA.not): 
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S^GGAGGAGCGGCCGCTTAAGGGCTCGGAACTTCAAAGGGCACO' (SEQ ID NO:303) 

For cloning purposes, a Cla I site and a Not I site were included in the forward primer and reverse 
primer respectively. 

To ensure the accuracy of the PCR products, independent PCR reactions were performed and several 

cloned products were sequenced. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
DNA98853 (SEQ ID NO:296, Figure 127) and the derived PR05727 native sequence protein UNQ2448 (SEQ 
ID NO:297, Figure 128). 

Clone DNA988S3 (SEQ ID NO:296) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 1-3 and ending at the stop codon (TAA) at nucleotide positions 
901-903 (Figure 127), as indicated by bo Ided underline. The predicted PROS727 polypeptide precursor (i.e.. 
UNQ2448, SEQ ID NO:297) is 299 amino acids long (Figure 128), has a calculated molecular weight of 
32,929 daltons and a pi of 4.95. The UNQ2448 polypeptide (SEQ ID NO:297) shown in Figure 128 has an 
estimated molecular weight of about 3.3 kilodaltons and a pi of about 4.72. A potential N-glycosylation site 
exists between amino acids 74 and 77 of the ammo acid sequence shown in Figure 128. A potential N- 
myristoylation site exists between amino acids 24 and 29 of the amino acid sequence shown in Figure 128. 
Potential casein kinase II phosphorylation sites exist between amino acids 123-126, 185-188. 200-203, 252- 
255. 257-260, 271-274. and 283-286 of the amino acid sequence shown in Figure 128. A potential 
transmembrane domain exists between amino acids 137 to 158 of the sequence shown in Figure 128. It is 
presently believed that the polypeptide does not include a signal sequence. 

A cDNA clone containing DNA98853 (SEQ ID NO:296. designated DNA98853-1739, has been 
deposited with ATCC on April 6, 1999 and is assigned ATTC Deposit No. April 6, 1 999. 

EXAMPLE 2 

Stimulatory Activity in Mixe d Lvmphocvte Reaction (MLR) Assay (no.24) 
This example shows that the polypeptides of the invention are active as a stimulator of the 
proliferation of stimulated T- lymphocytes. Compounds which stimulate proliferation of lymphocytes are 
useful therapeutically where enhancement of an immune response is beneficial. A therapeutic agent may take 
the form of antagonists of the polypeptide of the invention, for example, murine-human chimeric, humanized 
or human antibodies against the polypeptide. 

The basic protocol for this assay is described in Current Protocols in Immunology, unit 3.12; edited 
by J. E. Coligan, A. M. Kruisbeek. D. H. Marglies, E. M. Shevach. W. Strober, National Institutes of Health, 
Published by John Wiley & Sons, Inc. 

More specifically, in one assay variant, peripheral blood mononuclear ceils (PBMC) are isolated from 
mammalian individuals, for example a human volunteer, by leukopheresis (one donor will supply stimulator 
PBMCs, the other donor will supply responder PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be thawed overnight in assay media (37°C, 5% C0 2 )and then 
washed and ^suspended to 3 x 10* cells/ml of assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin, 1% glutamine, 1% HEPES, 1% non-essential amino acids, 1% pyruvate). 
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The stimulator PBMCs are prepared by irradiating the ceils (about 3000 Rads). The assay is prepared 
by plating in triplicate wells a mixture of: lOOul of test sample diluted to 1% or to 0.1%; 50 uJ of irradiated 
stimulator cells and 50 ul of responder PBMC cells. 100 microliters of cell culture media or 100 microliter of 
CD4-IgG is used as the control. The wells are then incubated at 37°C, 5% C0 2 for 4 days. On day 5 and each 
5 well is pulsed with tritiated thymidine (1.0 mC/well: Amersham). After 6 hours the cells are washed 3 times 
and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPMI;10% fetal bovine serum, 1% 
penicillin/streptomycin, 1% glutamine, 1% HEPES, 1% non-essential amino acids. 1% pyruvate) and the 

10 PBMCs are isolated by overlaying these cells over Lympholyte M (Organon Teknika), centrifuging at 2000 
rpm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resuspending the 
cells to Ix I0 7 cells/ml of assay media. The assay is then conducted as described above. The results of this 
assay for compounds of the invention are shown below. Positive increases over control are considered positive 
with increases of greater than or equal to 180% being preferred. However, any value greater than control 

1 5 indicates a stimulatory effect for the test protein. 

Table 7 



PRO 


PRO Concentration 


Percent Increase Over C 


PR0356 


0.1% 


133.8 


PR0356 


0.1% 


208.9 


PR0356 


1.0% 


251.6 


PR0356 


1.0% 


332.1 


PR0273 


12.4 nM 


112 


PR0273 


124 nM 


192.7 


PR0769 


23.86 nM 


76.3 


PR0769 


238.6 nM 


226 


PRO 1 184 


16.88 nM 


81.6 


PRO 1184 


168.82 nM 


194.4 


PR01346 


3.34 nM 


86.6 


PR01346 


33.41 nM 


188.5 


PRO 1246 


0.07 nM 


145 


PR01246 


0.7 nM 


180.9 


PR0269 


0.1% 


122.4 


PR0269 


1% 


194.1 


PR0344 


0.1% 


148.6 


PR0344 


1% 


259.9 


PR0333 


0.1% 


187.8 


PR0333 


1% 


220 


PR0381 


14.5 nM 


87.3 


PR038I 


14.5 nM 


135.4 
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PR0381 


145 nM 


248.1 


PR0381 


145 nM 


290.8 


PR0533 


0.06 nM 


163 


PR0533 


0.61 nM 


382.9 


PRO720 


0.1 nM 


198.4 


PRO720 


l.OnM 


293.5 


PR0866 


0.1 nM 


131.8 


PR0866 


1.04 nM 


223 2 



10 EXAMPLE 3 

Hairless Guinea pig Proinflammatory Assay (no. 32) 
This assay is designed to determine whether the PRO polypeptides show the ability to induce vascular 
permeability. Polypeptides testing positive in this assay are expected to be useful for the therapeutic treatment 
of conditions which would benefit from enhanced vascular permeability including, for example, conditions 

1 5 which may benefit from enhanced local immune system cell infiltration. 

Hairless guinea pigs weighing 350 grams or more were anesthetized with Ketamine (75-80 mg/kg) 
and 5 mg/kg Xyiazine intramuscularly. Test samples containing the PRO polypeptide or a physiological buffer 
without the test polypeptide are injected into skin on the back of the test animals with 100 ul per injection site 
intradermally. There were approximately 16-24 injection sites per animal. One mi of Evans blue dye (1% in 

20 PBS) is then injected intracardially. Skin vascular permeability responses to the compounds (i.e.. blemishes at 
the injection sites of injection) are visually scored by measuring the diameter (in mm) of blue-colored leaks 
from the site of injection at I, 6 and/or 24 hours post administration of the test materials. The mm diameter of 
blueness at the site of injection is observed and recorded as well as the severity of the vascular leakage for 
values scoring above 4 standard deviations over the same animal control. Blemishes of at least 5 mm in 

25 diameter are considered positive for the assay when testing purified proteins, being indicative of the ability to 
induce vascular leakage or permeability. A response greater than 7 mm diameter is considered positive for 
conditioned media samples. Human VEGF is used as a positive control, inducing a response of 4-8 mm 
diameter at 0. 1 jig/ 100 ul, and 15-23 mm diam. at I ug/100 ul. 

The tested polypeptide are diluted to 1% of the initial stock solution. UNQ 585 was diluted into 10 

30 mM HEPES/140 mM NaCl/4% mannitol/1 mg/ml BSA pH 6.8, while UNQ334 was diluted into 140 mM 
NaCl 10 mM Hepes, 4% Mannitol pH,7.4. 

Table 8 

UNQ polypeptide Stock solution concentration Time (hr) dialation (mm) 

PROU55 20,384 nM 1 6 

35 PR01155 20,384 nM 6 6 

PR0533 1024 nM I 5.4 

PR0533 1024 nM 6 7 

PR021 22,000 nM 1 2.0 

PR021 22,000 nM 6 14.0 

40 
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EXAMPLE 4 
Skin Vascular Permeability Assay fno.64) 

This assay shows that certain PRO polypeptides stimulate an immune response and induce 
inflammation by inducing mononuclear ceil, eosinophil and PMN infiltration at the site of injection of the 
animal. This skin vascular permeability assay is conducted as follows. Hairless guinea pigs weighing 350 
grams or more are anesthetized with ketamine (75-80 mg/Kg) and 5 rag/Kg Xyiazine intramuscularly (IM). A 
sample of purified PRO polypeptide or a conditioned media test sample is injected intradermally onto the backs 
of the test animaJs with 100 uL per injection site. It is possible to have about 10-30, preferably about 16-24, 
injection sites per animal. One mL of Evans blue dye (1% in physiologic buffered saline) is injected 
intracardially. Blemishes at the injection sites arc then measured (mm diameter) at lhr, 6 hrs and 24 hrs post 
injection. Animals were sacrificed at 6 hrs after injection. Each skin injection site is biopsied and fixed in 
paraformaldehyde. The skins are then prepared for histopathalogic evaluation.- Each site is evaluated for 
inflammatory ceil infiltration into the skin. Sites with visible inflammatory ceil inflammation are scored as 
positive. Inflammatory cells may be neutrophilic, eosinophilic, monocytic or lymphocytic 

At least a minimal perivascular infiltrate at the injection site is scored as positive, no infiltrate at the 
site of injection is scored as negative. 



Table 9 



UNO 


Time (hrs) 


Infiltrate 


PRO 172 


24 


positive 


PRO200 


24 


positive 


PRO200 


24 


positive 


PR0216 


24 


positive 


PR0272 


24 


positive 


PR0362 


24 


positive 


PRO 1 007 


24 


positive 


PRO 1031 


24 


positive 


PR01283 


24 


positive 


PRO 1343 


24 


positive 


PR01358 


6 


positive 




PR01325 


6 


positive 


PRO 1434 


24 


positive 


PR04333 


6 


positive 




EXAMPLE 5 






Inhibitory Activity in Mixed Lvmohocvte Reaction (MLR) Assav (no 


67) 



Ibis example shows that one or more of the PRO polypeptides are active as inhibitors of the 
proliferation of stimulated T-lymphocytes. Compounds which inhibit proliferation of lymphocytes are useful 
therapeutically where suppression of an immune response is beneficial. 
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The basic protocol for this assay is described in Current Protocols in Immunology, unit 3.12; edited 
by J. E. Coligan, A. M. Kruisbeek, D. H. Marglies, E. M. Shevach, W. Strober, National Institutes of Health, 
Published by John Wiley & Sons, Inc. 

More specifically, .in one assay variant, peripheral blood mononuclear ceils (PBMC) are isolated from 
mammalian individuals, for example a human volunteer, by leukopheresis (one donor will supply stimulator 
PBMCs, the other donor will supply responder PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be thawed overnight in assay media (37°C, 5% CO,) and then 
washed and resuspended to 3xl0 6 cells/ml of assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin, 1% glutamine, 1% HEPES, 1% non-essential amino acids, 1% pyruvate). The 
stimulator PBMCs are prepared by irradiating the cells (about 3000 Rads). 
The assay is prepared by plating in triplicate wells a mixture of: 
100:1 of test sample diluted to 1% or to 0. 1%, 
50 :! of irradiated stimulator cells, and 
50 :1 of responder PBMC ceils. 
100 microliters of ceil culture media or 100 microliter of CD4-IgG is used as the control. The wells are then 
incubated at 37°C, 5% C0 2 for 4 days. On day 5, each well is pulsed with tritiated thymidine (1.0 mC/well; 
Amersham). After 6 hours the cells are washed 3 times and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin. 1% glutamine, 1% HEPES, 1% non-essential amino acids, 1% pyruvate) and the 
PBMCs are isolated by overlaying these cells over Lympholyte M (Organon Teknika), centrifoging at 2000 
rpm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resuspending the 
cells to lxlO 7 ceils/mi of assay media. The assay is then conducted as described above. 

Any decreases below control is considered to be a positive result for an inhibitory compound, with 
decreases of less than or equal to 80% being preferred. However, any value less than control indicates an 
inhibitory effect for the test protein. 



PRO 

PRO204 

PRO204 

PR0212 

PR0212 

PR02I2 

PR0212 

PR0212 

PR0212 

PR0212 

PR0212 

PR0212 



Table 10 

PRO Concentration Percent Decrease Below Control 

0.1% 86 

1.0% 35 

0.59 nM 0 

5.9 nM 52.6 

0.87 nM 82.7 

8.7 nM 66 

1.9 nM 81.6 

19 nM 61J 

0.46 nM 66.1 

4.6 nM 59 J 

2.1 nM o 
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PRO?!'? 


IA nM 


116.2 




OM 


0 


PR fY? 1 7 


nM 


62.2 


PRr^lA 
rlxL/^ 1 u 


A 1 ^ — li 

0.13 nM 


74.3 


r m/z 1 o 


1.3 nM 


63.3 




0A2 nM 


67.9 




1.2 nM 


40.6 




0 nM 


83.6 




0.02 nM 


69.7 




5.3 nM 


68.2 




53 nM 


68.2 




35 nM 


72.2 




350 nM 


64 


pp 


19.1 nM 


53 




191 nM 


54 




0.93 nM 


71.8 




0.93 nM 


80.9 




9.3 nM 


49.6 




9.3 nM 


51.9 


PKL?273 


3L46n_M 


81 


rKUzVJ 


3 14.56 nM 


67 




0.35 nM 


. 74.2 


rKUiJz 


3.5 nM 


68 


PRQ332 


0.35 nM 


20.2 




3.5 nM 


61.2 


rKUJoi 


1.5 nM 


63.2 




15 nM 


64.7 




8.6 nM 


76.9 




86 nM 


63.6 




8.6 nM 


64.4 




86 nM 


2.1 




0.31 nM 


68.1 




3.1 nM 


67.4 


PR m/vd. 


1.7 nM 


92.8 


rK.UJo4 


17 nM 


68.4 


PR0364 


1.7 hM 


94.2 


PR0364 


17 nM 


63.3 


PR0526 


0.12 nM 


68.5 


PR0526 


1.2 nM 


62.5 


PR053I 


0.2 nM 


66.1 
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PR053 1 


7 nM 

Z XUVl 


54.3 




PR053 1 


v.x mvi 


70.4 




PROS 3 1 


7 nM 

z niYi 


68.4 




PRO701 


U. /*r lUVl 


72.5 


5 


PRO70 1 


u. /4 ruvi 


90.2 




PRO70 1 


/.4 I1M 


64.8 




PRO701 


/.4 nM 


69 




PRO770 


ft in 


65.8 




PRO770 


£ ft _\/ 

o.y nM 


67.4 


10 


PR07RS 


iz.yo om 


88.4 




PRH78R 
r i\.w / o o 


lzv.o nM 


57.7 




PR07RS 


z.y nM 


64.4 




PR07RR 


70. M ftvf 

zy nM 


67.4 




ri\\Jouj 


ft n — x ji 
U.z7 nM 


67.9 


i j 


I IVvOOJ 


z./ nM 


63.7 




PROI (\%\ 

kt\\J I I/O J 


7.1 nM 


80.5 




ppni fWl 
r I Uo J 


71 nM 


63.7 




PR HI (S%\ 
rWKJ \ Uo J 


7.1 nM 


40.9 




rKLHUOJ - 


71 nM 


65 , 


Zu 


ppr\i i iii 


0.37 nM 


44.9 




PPPlI ! 1 J 

rK\JI 114 


3.7 nM 


42.4 




rKUl IyZ 


I2.I nM 


31.6 




pon 1 1 07 
rKAJ I 1VZ 


121 nM 


32.6 






0.5 nM 


67 


ZJ 




C —\ A 

5 nM 


66.8 




I I\U 1 Z JU 


ft ft C _ K A 

U.05 nM 


75.4 




PROI 7 


ft C —\ J 

0.3 nM 


57.2 




PROI 7Sft 


ft t\C — h 4 

U.U5 nM 


94.6 




PROI 7 SO 


ft < w\A 

U.j nM 


61.2 


30 




o.j nM 


52 




r rvvj i j i z 


oj nM 


49.3 




pro n 1 2 


14-Z nM 


73.1 






i4z nM 


62.9 




PR012S7 


u.o nM 


79.1 


jj 


di?oi m7 
rivLM jo / 


o nM 


52J 




PRO1410 


4nM 






PRO 14 10 


40 nM 


64.8 




PR01418 


6.4 nM 


67.7 




PR01418 


6.4 nM 


81.1 


40 


PRO 14 18 


64 nM 


56.3 
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pr nidi s 


(LA _ \A 


£.A A 

64.9 


PR Hi RAR 


jy.*f njvi 


65.0 


ppni 

rivw l oOo 


Jy*t nM 


50 


pp n i o 1 7 


7 1 «K>f 

2.1 nM 


70.7 


ppr\ioi7 
rrvvj i y t / 


7 1 «AX 

2.1 nM 


82.5 


ponio 1 7 


"> 1 — Xif 

21 nM 


60.7 


rtsAJ I y I / 


21 nM 


62.6 


por\7n< 


U.7 nM 


71.5 


pp o*>fK 


/ nM 


3.5 


1 KAJohU 


7/* /I «Xj( 

Z4.4 nM 


137.2 




244 nM 


58.9 


rKUoJo 


2.5 nM 


60.7 


rKOSJO 


25 nM 


60.6 


rKUl 1 jy 


1 1.06 nM 


80.4 


PKU1 159 


1 10.55 nM 


57.6 


rROl 159 


11.06 nM 


81.9 


PRO! 159 


1 10.55 nM 
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1.4 nM 
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13.56 nM 
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PRO4302 


135.57 nM 


2.4 
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24.2 nM 


55.9 


PRO4430 


242 nM~ 


49.9 


PR05727 


19.6 nM 


69.2 


PR05727 


196 nM 


54.5 



EXAMPLE 6 
In situ Hybridization 

In situ hybridization is a powerful and versatile technique for the detection and localization of nucleic 
acid sequences within cell or tissue preparations. It may be useful, for example, to identify sites of gene 
expression, analyze the tissue distribution of transcription, identify and localize viral infection, follow changes 
in specific mRNA synthesis and aid in chromosome mapping. 

In situ hybridization was performed following an optimized version of the protocol by Lu and Gillett, 
Cell Vision U 169-176 (1994), using PCR-generated 33 P-Iabeled riboprobes. Briefly, formalin- fixed, 
paraffin-embedded human tissues were sectioned, deparaffinized, deproteinated in proteinase K (20 g/ml) for 
15 minutes at 37°C, and further processed for in situ hybridization as described by Lu and Gillett, supra. A 
[ 33 P] UTP-labeled antisense riboprobe was generated from a PCR product and hybridized at 55°C overnight. 
The slides were dipped in Kodak NTB2 nuclear track emulsion and exposed for 4 weeks. 

33 P-Riboprobe synthesis 
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6.0 Ml (125 mCi) of 33 P-UTP (Amersham BF 1002, SA<2000 Ci/mmol) were speed vac dried. To 
each tube containing dried 33 P-UTP, the following ingredients were added: 2.0 ul 5x transcription buffer; 1.0 
ul DTT (100 mM); 2.0 ui NTP mix (2.5 raM : 10 ul; each of 10 mM GTP, CTP & ATP + 10 ul H 2 0); 1.0 ul 
UTP (50 uM); 1.0 ul Rnasin; 1.0 ul DNA template (lug); 1.0 ul H 2 0. 

The tubes were incubated at 37°C for one hour. 1.0 uL RQi DNase were added, followed by 
incubation at 37°C for 15 minutes. 90 uL TE (10 mM Tris pH 7.6/lmM EDTA pH 8.0) were added, and the 
mixture was pipetted onto DE81 paper. The remaining solution was loaded in a Microcon-50 ultrafiltranon 
unit, and spun using program 10 (6 minutes). The filtration unit was inverted over a second tube and spun 
using program 2 (3 minutes). After the final recovery spin, 100 uL TE were added. I uL of the final product 
was pipetted on DE8 1 paper and counted in 6 ml of Biofluor II. 

The probe was run on a TBE/urea gel. 1-3 uL of the probe or 5 uL of RNA Mrk III were added to 3 
uL of loading buffer. After heating on a 95°C heat block for three minutes, the gel was immediately placed on 
ice. The wells of gel were flushed the sample loaded, and run at 180-250 volts for 45 minutes. The gel was 
wrapped in saran wrap and exposed to XAR Him with an intensifying screen in -70°C freezer one hour to 
overnight. 

33 P-Hybridization 

Pretrcatmcru of frozen sections The slides were removed from the freezer, placed on aluminum 
trays and thawed at room temperature for 5 minutes. The trays were placed in 55 °C incubator for five minutes 
to reduce condensation. The slides were fixed fnr in mimt*** ao/^ „„..„r~ \a~u..j :„ ►u- a.— « 

* — • • >* •««»**»*fcwo 411 -r f <J put UIU1 UUllUSrll^ UU Vil L^V* 111 11 IX* lUiliV. 

hood, and washed in 0.5 x SSC for 5 minutes, at room temperature (25 ml 20 x SSC + 975 ml SQ H 2 0). After 
deproteination in 0.5 ug/ml proteinase K for 10 minutes at 37°C (12.5uL of 10 mg/ml stock in 250 mi 
prewarmcd RNase-free RNAse buffer), the sections were washed in 0.5 x SSC for 10 minutes at room 
temperature. The sections were dehydrated in 70%. 95%, 100% ethanoL 2 minutes each. 

Pretreatmem of paraffin -embedded sections The slides were deparaffinized, placed in SQH 2 0, and 
rinsed twice in 2 x SSC at room temperature, for 5 minutes each time. The sections were deproteinated in 20 
ug/ml proteinase K (500 uL of 10 mg/ml in 250 mi RNase-free RNase buffer; 37C, 15 minutes ) - human 
embryo, or 8 x proteinase K. (100 uL in 250 ml Rnase buffer, 37°C, 30 minutes) - formalin tissues. 
Subsequent rinsing in 0.5 x SSC and dehydration were performed as described above. 

Prehybridization The slides were laid out in plastic box lined with Box buffer (4 x SSC, 50 rf /o 
formamide) - saturated filter paper. The tissue was covered with 50 uL of hybridization buffer (3.75g Dextran 
Sulfate + 6 ml SQ H 2 0), vortexed and heated in the microwave for 2 minutes with the cap loosened. After 
cooling on ice, 18.75 mi formamide, 3.75 mi 20 x SSC and 9 ml SQ H 2 0 were added, the tissue was vortexed 
well, and incubated at 42°C for 1-4 hours. 

Hybridization 1.0 x 10 6 cp. probe and 1.0 uL RNA (50 mg/ml stock) per slide were heated at 
95°C for 3 minutes. The slides were cooled on ice, and 48 uL hybridization buffer were added per slide. After 

vortexing, 50 uL 33 P mix were added to 50 uL prehybridization on slide. The slides were incubated overnight 
at55C. 
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Washes Washing was done 2x10 minutes with 2xSSC, EDTA at room temperature (400 ml 20 x SSC 
+ 16 ml 0.25M EDTA, Vf«4L), followed by RNaseA treatment at 37°C for 30 minutes (500 uL of 10 mg/ml in 
250 ml Rnase buffer - 20 ug/mi), The slides were washed 2x10 minutes with 2 x SSC, EDTA at room 
temperature. The stringency wash conditions were as follows: 2 hours at 55°C 0.1 x SSC, EDTA (20 ml 20 x 
5 SSC + 16 ml EDTA, Vf=4L). 

Alternatively, multi-tissue blots containing poly A 4 * UNA (2 fig per lane) from various human tissues 

were purchased from Clontech (Palo Alto, CA). DNA probes were labeled with [cc- 32 P]dCTP by random 
priming DNA labeling Beads (Pharmacia Biotech). Hybridization was performed with Expresshyb (Clontech) 
at 68°C for 1 nr. The blots were then washed with 2X SSC/0.05% SDS solution at room temperature for 40 
10 min, followed by washes in 0. IX SSC/0. 1%SDS solution at 55°C for 40 min with one change of fresh solution. 
The blots were exposed in a phosphorimager. 

DNA 29101 (VEGFB9) 

DNA29101 (SEQ ID NO:l) was examine in three separate in xittt studies wherein the following probes were 
15 used: 

VEGFB9-pI (SEQ ID NO: 194): 

S'-GGATTCTAATACGACTCACTATAGGGCGGCGGAATCCAACCTGAGTAGO' 
VEGFB9-p2 (SEQ ID NO: 195): 
j 5'-CTA TG A AAT TAA CCC TCA CTA AAG GG A GCG GCT ATC CTC CTG TGC TC-3' 

20 

IS97-029: 

Expression observed in the developing lower fetal limb bones at the edge of the cartiiagenous anlage 
(Le„ around the outside edge); in developing tendons, in vascular smooth muscle and in cells embracing 
developing skeletal muscle myocytes and myotubes. Expression also observed in the following tissues: 
25 epiphyseal growth plate: lymph nodes - marginal sinus: thymus - subcapsular region of the thymic cortex, 
possibly representing either the subcapsular epithelial cells or the 

proliferating, double negative, thymocytes that are found in this region; tracheal smooth muscle; brain 
(cerebral cortex) - focal expression in cortical neurones; small intestine - smooth muscle; thyroid - thyroid 
epithelium; liver - ductal plates; stomach - mural smooth muscle: fetal skin - basal layer of squamous 
30 epithelium: placenta - interstitial ceils in trophoblastic villi: spinal cord - no expression except in wall of 
arteries and veins. No expression was observed in the spleen and adrenals. 

The above expression pattern suggests that DNA29101 may be involved in ceil differentiation 
/proliferation. 

35 IS97-037: 

Expression in superovulated rat ovaries were negative in ail sections with both antisense and sense 
probes. Either the message is not expressed in this model, or the human probe does not cross react with rat. 

IS97-087: 
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High expression levers were observed at the following sites: chimp ovary - granulosa cells of 
maturing follicles, lower intensity signal observed over thecal cells; chimp parathyroid - high expression over 
chief cells; human fetal testis - moderate expression over stromal cells surrounding developing tubules: human 
fetal lung - high expression over chondrocytes in developing bronchial tree, and low level expression over 
branching bronchial epithelium. 

Fetal tissues examined (E12-E16 weeks) include: placenta, umbilical cord, liver, kidney, adrenals, 
thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, 
spinal cord, body wail, pelvis and lower limb. Adult tissues examined include liver, kidney, adrenal, 
myocardium, aorta, spleen, lymph node, pancreas, lung, skin, cerebral cortex (rm), hippocampus(rm), 
.cerebellum(rm), penis, eye, bladder, stomach, gastric carcinoma, colon, colonic carcinoma and 
chondrosarcoma. Also examined were acetaminophen induced liver injury and hepatic cirrhosis 



DNA3Q871 : 

IS97-044: In fetal tissues, strong signals were observed over neurones in fetal cerebral cortex, spinal cord, 
spinal ganglia as well as enteric neurones in the wall of the fetal stomach. Signal also observed over cells 
around the root of the aorta (possibly the conducting system), adrenal medulla, mesenchymal cells in 
neurovascular bundle, renal parenchyma and cells lying between skeletal muscle myocytes. All other fetal 
tissues negative. 

No expression was observed in adult tissue. Fetal tissues (12-16 weeks) examined include: placenta, 
umbilical cord, liver, kidney, adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, 
spleen, thymus, pancreas, brain, eye, spinal cord, body wail, pelvis and lower limb. Adult tissues examined 
include: liver, kidney, adrenal, myocardium, aorta, spleen. lymph node, pancreas, lung and skin. 
The probes used in the above analysis were the following: 
DNA3087I-pI (SEO ID NO: 196): 

5'-GGA TTC TAA TAC G AC TCA CTA TAG GGC CTC CCG TCT CCT CCT GTC CTC-3' 
DNA30871-p2 (SEO ID NO: 197): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCT CGG CAT CTT CGT CAC ATT- 3* 
DNA30942 : 

DNA30942 (SEQ ID NO: 13) was examined in four separate in situ studies (including two in the diseased tissue 
study of Example 7 using the following probes: 
DNA30942-pl (SEQ ID NO: 198) 

5*-GGA TTC TAA TAC GAC TCA CTA TAG GGC TCG CTG CTG TGC CTG GTG TTG-3' 
DNA30942-p2: (SEQ ID NO: 1 99) 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCG CTG CAG CCT CTT GAT GGA-3' 
IS97-043: 
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No expression was observed in fetai tissues. The fetal tissues examined included: placenta, umbilical 
cord, brain, spinal cord, eye, optic nerve, trachea, lung, heart, thymus, liver, spleen, esophagus, small intestine, 
pancreas, adrenal/thyroid, body wall and lower limb. 

No expression was observed in adult tissues. The adult tissues examined included liver, kidney, adrenal, 
5 myocardium, aorta, spleen, lymph node, pancreas, lung and skin. 

DNA33087 (IS97-Q51): 

In fetal tissue, expression of DNA33087 (SEQ ID NO: 18) was observed in osteoblasts at ail sites of 
enchondral and periosteal new bone formation, the developing pulmonary arterial and aortic trunks. The fetal 
10 tissues examined included: placenta, umbilical cord, brain, spinal cord, eye, optic nerve, trachea, lung, heart 
thymus, liver, spleen, esophagus, small intestine, pancreas, adrenal, thyroid, body wall and lower limb. 

No expression was observed in the adult tissues examined including: Liver, kidney, adrenal, 
myocardium, aorta, spleen, lymph node, pancreas, lung and skin. 
The probable role in control of bone matrix deposition and or osteoblast growth. 
1 5 All adult tissues in the muitiblock were positive for beta-actin. 

The probes used in this procedure were the following: 

DNA33087-pl (SEQ ID NO:200): 

5'-GG A TTC TAA TAC GAC TCA CTA TAG GGC CCC GAG TGT TTT CCA AGA-3' 
20 DNA33087-p2 (SEQ IDNO:201): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GG A CAA GTT TAC TAG CCC ATC CAT-3' 
DNA33087-p3 (SEQ ID NO:202): 

5*-GGA TTC TAA TAC GAC TCA CTA TAG GGC TGG ATG GGC TAG TAA ACT TGA- 3' 
DNA33087-p4 (SEQ ID NO:203): 
25 5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCC TTC TGC TCC TTC TTG TT-3' 

DNA34387 (IS97-109): 

The expression pattern of DNA34387 (SEQ ID NO:25) was observed in fetai and adult human tissues at the 
following sites: 

30 Fetal - thyroid epithelium^ small intestinal epithelium, gonad, pancreatic epithelium, hepatocytes in liver and 
renal tubules. Expression also seen in vascular tissue in developing long bones. 

Adult - Moderate signal in placental cytotrophoblast, renal tubular epithelium, bladder epithelium, parathyroid 
and epithelial tumors. 

The fetal (EI2-E16 weeks) tissues examined included: placenta, umbilical cord, liver, kidney, adrenals, 
35 thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, 
spinal cord, body wah\ pelvis and lower limb. 

The adult human tissues examined: kidney (normal and end-stage), adrenal, myocardium, aorta, spleen, lymph 
node, gall bladder, pancreas, lung, skin, eye (inc. retina), prostate, bladder, liver (normal, cirrhotic, acute 
failure). 

40 The non-human primate tissues examined included the following: 
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Chimp tissues: Salivary gland, stomach, thyroid, parathyroid, skin, thymus, ovary, lymph node. 
Rhesus Monkey Tissues: Cerebral cortex, hippocampus, cerebellum, penis. 

The probes used in this procedure were the following: 
DNA34387.pl (SEQ ID NO:206): 
5 5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CCG AGA TAT GCA CCC AAT GTC-3' 
DNA34387-p2 (SEQ ID NO:207): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GG A TCC CAG AAT CCC G AA GAA CA-3' 

DNA35638 : 
10 IS97-078: 

Expression of DNA35638 (SEQ ID NO:35) was observed in the endothelium lining a subset of fetal and 
placental vessels. Endothelial expression was confined to these tissue blocks. Expression also observed over 
intermediate trophoblast cells of placenta. 

The fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 
15 adrenals, thyroid, lungs, hean. great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 
brain, eye. spinal cord, body wall, pelvis and lower limb. 

The adult tissues examined included: liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, 
lung/ skin, cerebral cortex (rm), hippocampus(rm). cerebellum(rm), penis, eye, bladder, stomach, gastric 
carcinoma, colon, colonic carcinoma, thyroid (chimp), parathyroid (chimp) 
20 ovary (chimp) and chondrosarcoma. Also examined was tissue derived from acetaminophen induced liver 
injury and hepatic cirrhosis. 

The oligos used for the above procedure were the following: 
DNA35638.pl (SEQ ID NO:208): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC GGG AAG ATG GCG AGG AGG AG-3* J 
25 DNA35638-p2 (SEQ ID NO:209): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCA AGG CCA CAA ACG GAA ATC-3' 

DNA39523 : 

The following probes were used in the in situ studies below: 
30 DNA39523-pI (SEQ ID NO:210): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC AGC GCA CGG CCA CAG ACA-3* 
DNA39523-p2 (SEQ ID NO:2 1 1 ): 

5*-CTA TGA AAT TAA CCC TCA CTA AAG GGA GAC CCT GCG CTT CTC GTT CCA-3' 
35 198-052: 

DNA39523 (SEQ ID NO:45) in normal human skin (neonatal foreskin) and adult psoriatic skin both 
exhibited specific strong expression in the epithelial cells of the stratum basale - the single layer along the 
basement membrane which is the progenitor for all of the overlying epidermal cells in the skin. 

There was no expression in epidermal cells in the overlying layers (stratum spinosum, straum 
40 granulosum, etc.). The intensity of the signal was slightly increased in psoriatic skin. Expression was also 
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apparent in the dermis (the connective tissue immediately underlying . the epidermis) of both normal and 
psoriatic skin. Expression here was most apparent in spindle shape cells within the collagen matrix - the 
stromal fibroblasts. 

In the brain, sections of cerebrum had strong specific expression in a subset of superficial cortical 
5 neurons - a distinct pattern suggesrive of a specific population of cortex neurons. 

In inflamed and normal bowel: Normal human large bowel and bowel with either Crohns's disease or 
ulcerative colitis had specific moderate to strong expression in a multifocal pattern within the lamina propria of 
villi. The cells labeled by in situ were spindloid stromal cells best delineated as 

fibroblasts. There was no expression by intestinal epithelial cells and there was no apparent increased 

10 expression (intensity or frequency) in diseased bowel. Specifically there was also no correlation of expression 
and lesions in the inflamed bowel. 

In human fetal kidney, there was specific weak to moderate expression in multifocal developing 
tubules; expression was in the tubular epithelium in these foci. 

The expression of DNA39523 (SEQ ID NO:45) in the skin and specific localization to the basal 

15 epithelial cells of the epidermis cells suggests a potential role in differentiation/ maintenance of the basal 
epidermal cells. This expression pattern in combination with the fact that expression occurs in ceils that are 
directly adjacent to the basement lamina, suggests that the cells regulate trafficking of leukocytes into the 
epidermis. As a result DNA39523 (SEQ ID NO:45) may be a constitutive ly expressed signal for the 
trafficking of dendritic/ Langerhan cells or lymphocytes into the epidermis. Such trafficking is a normal 

20 physiologic event that occurs in normal skin and is thought to be involved in immunosurveillance of the skin. 

The expression of DNA39523 (SEQ ID NO:45) in inflammatory bowel disease was not increased 
from normal tissue, and there was no correlation of its expression to inflammatory lesions. Similarly, its 
expression in the basal epidermal cells in psoriatic skin lesions was equivalent to or only slightly greater than 
that seen in normal neonatal skin (but age-matched control adult skin was not available at the time of the 

25 study). 

IS97-I28: 

The expression of DNA39523 (SEQ ID NO:45) was observed in the epithelium of mouse embryo skin 
as well as the basal epithelium and dermis of human fetal skin. The basal epithelial pegs of the squamous 

30 mucosa of the chimp tongue are also positive. Expression was also observed in a subset of cells in developing 
glomeruli of fetal kidney, adult renal tubules, and over "thyroidized" epithelium in end-stage renal disease. 
However, low expression was also seen in a renal cell carcinoma, probably over the epithelial cells. 
Expression was also observed in the stromal cells both (1) at low levels in fetal lung, and (2) in the apical 
portion of gastric glands. High expression was indicated in the lamina propria of the fetal small intestinal villi, 

35 normal colonic mucosa and over stromal cells in a colonic carcinoma. Strong expression occurred in benign 
connective tissue cells in the hylanized stroma of a sarcoma. Expression also occurred in stromal cells in the 
placental villi and the splenic red pulp. In the brain, expression occurred in cortical neurones. 

DNA39523 (SEQ ID NO:45) was also expressed in the connective tissue surrounding developing 
bones and over nerve sheath cells in the fetus. 

140 



SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCT/USOO/05841 



The fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 
adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 
brain, eye, spinal cord, body wall, pelvis and lower limb. The adult tissues examined included: liver, kidney, 
adrenal, myocardium, aorta, spleen, lymph node, pancreas, lung, skin, cerebral cortex (rm), hippocampus(rm), 
5 eye, stomach, gastric carcinoma, colon, colonic carcinoma, thyroid (chimp), parathyroid (chimp) ovary (chimp) 
and chondrosarcoma. Also examined included acetaminophen induced liver injury and hepatic cirrhosis. 

IS98492: 

The expression of DNA39523 (SEQ ID NO:45) was present in many ceils in the outer layers (I and II) 
10 of the monkey cerebral cortex. A small subset of cells in the deeper cortical layers also expressed mRNA for 
the chemokine homolog. Scattered cells within the molecular layers of the hippocampus and bordering the 
inner edge of the dentate gyrus showed expression of DNA39523 (SEQ ID NO:45). No expression was 
detected within the cerebellar cortex. Expression of DNA39523 (SEQ ID NO:45) was not observed in 
infarcted brain, where cell death has occurred- in the regions where the chemokine homolog normally is 
15 expressed. DNA39523 (SEQ ID NO:45) could possibly serve as a marker of a subset of neurons of outer 
layers of the cerebral concx and could possibly reveal neuronal migration disorders. Abnormal neuronal 
migration is a possible cause of some seizure disorders and schizophrenia. 



20 

1598- I28: 

DNA39523 (SEQ ID NO:45) showed intriguing and specific patterns of hybridization within postnatal 
day (P) 10 and adult mouse brains. In one sagittal section of PIO mouse brain, strong signal was observed 
scattered within the molecular layer of the hippocampus and inner edges of the dentate gyrus. Cells in the 

.25 presubiculum were moderately labeled; the signal extended in a strong band through outer layers of the 
retrosplenial cones to the occipital cortex, where the signal diminished to background levels. A small set of 
positive neurons were detected in deeper regions of P10 motor cortex: neurons in outer layers of P10 cortex did 
not exhibit signal above background levels. Moderate hybridization signal was also detected in the inferior 
colliculus. Chemokine homolog signal in the adult mouse brain was evaluated in three coronal sections at 

30 different levels. Strong signal was detected in the septum and in scattered neurons in the pontine nuclei and 
motor root of the trigeminal nerve; moderate signal was seen in the molecular layers of the hippocampus and 
outer layers of the retrosplenial cortex. 

r 

1599- 027: 

35 Bolekine (also known as BRAK - the chemokine to which DNA39523 (SEQ ID NO:45) bear 

significant homology) belongs to a chemokine subgroup characterized by a cys-x-cys (CXC) motif and absence 
of an ammo-terminal glu-Ieu-arg (ELR). Non-ELR CXC chemokines (includingSDF-l, IP10, Mig and PF4) are 
chemotactic for subsets of leukocytes including B and T lymphocytes. They also have angiostatic activity. 

DNA39523 (SEQ ID NO:45) was detected in Postnatal day (P) 1 mouse brain, bolekine signal was 

40 detected in the hippocampus (stratum lacunosum moleculare and hilus of the dentate gyrus) and anterior, 
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olfactory nucleus, but not in the developing cerebral cortex or cerebellum. By PIO, signal is present in a subset 
of cells in layers 1 & 2 of the cerebral cortex. A small population of cells in the deeper layers also express 
DNA39523 (SEQ IDNO:45). The pattern in the hippocampus resembled the PI brain. Weak signal is present in 
the cerebellum, especially lobules IX and X. Signal is also present in the dorsal striatum and colliculi. 
5 In the adult mouse brain, bolekine-positive cells were difficult to detect in the adult cerebral cortex, 

but signal is present in the anterior olfactory nucleus and hippocampus. In ischemic mouse brains, however, 
bolekine signal is induced in the penumbra. 

In the developing cerebral cortex, bolekine expression correlates with final stages of neuronal 
migration and the establishment of axonal projections and synaptogenesis. Other CXC chemokines have roles 
10 in neuronal migration and patterning in the central nervous system (SDF-1) and modulation of neuronal 
activity (IL-8 and GRO-a). 

Bolekine expression is induced in ischemic -reperfusion injury in the brain, but not in other 
inflammatory states. 

15 DNA47365 /IS97-142): In fetal tissues, the expression of DNA47635 (SEQ ID NO;91) was observed in the 

fascia lining the anterior surface of the vertebral body. There is expression over the fetal retina. Low level 

expression over fetal neurones. 

The following probes were used in the above analysis: 

DNA47365-pl (SEQ ID NO:214): 
20 5'-GGA TTC TAA TAC G AC TCA CTA TAG GGC AAC CCG AGC ATG GCA CAG CAC-3* 

DNA47365-p2 (SEQ ID NO:215): 

5'-CTA TG A AAT TAA CCC TCA CTA AAG GG A TCT CCC AGC CGC CCC TTC TC-3' 
DNA49435 (IS97-136): 

25 Moderate expression of DNA49435 (SEQ ID NO:l i I) was observed over conical neurones in the 

fetal brain. Expression was also present over the inner aspect of the fetal retina, possible expression in the 
developing lens. Expression was seen over fetal skin, cartilage, small intestine, placental villi and umbilical 
cord. In adult tissues there is an extremely high level of expression over the gallbladder epithelium. Moderate 
expression of DNA49435 (SEEQ ID NO: 1 1 1) was seen over the adult kidney, gastric and colonic epithelia. 

30 The human fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 

adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 
brain, eye, spinal cord, body wail, pelvis, testis and lower limb. The adult human tissues examined included: 
kidney (normal and end-stage), adrenal, spleen, lymph node, pancreas, lung, eye (inc. retina), bladder, liver 

(normal, cirrhotic, acute failure). 

o 

35 The non-human primate tissues examined included the adrenal glands from chimp tissues and the 

cerebral cortex, hippocampus and cerebellum of rhesus monkey tissues. 

The probes used in the above analysis were the following: 
DNA49435-pl (SEQ ID NO:218): 

S-GGA TTC TAA TAC GAC TCA CTA TAG GGC GGA TCC TGG CCG GCC TCT G-3' 
40 DNA49435-p2 (SEQ ID NO:219): 
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5'-CTA TG A AAT TAA CCC TCA CTA AAG GG A GCC CGG GCA TGG TCT CAG TTA-3* 
DNA54228 (IS98-105): 

Expression of DNA54228 (SEQ ID NO: 133) was observed in bone spicules: fetal metaphyseal bone, 
5 fetal calvarium (skull) and bone tissue in human neoplasia (osteosarcoma and chondrosarcoma). There is weak 
but consistent signal in small bone spicules in the metaphysis of fetal bone and in ossified spicules in a 
chondrosarcoma and an osteosarcoma. No signal was detected in human lung, liver, thymus, kidney, thyroid, 
brain, spleen, fetal tissues including adrenal, brain, cartilage, lung, liver, intestine, gonad, heart and skin. 
The probes used in the above procedure were the following: 
1 0 hmDETI-p I (SEQ ID NO:220): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC ACC ACC ACC CAG GAG C-3' 
hmDETI-p2 (SEQ IDNO:221): 

5'-CTA TG A AAT TAA CCC TCA CTA AAG GGA AAT G AA GTG GGA CGT TTG AGT-3* 
DNA54228-pl (SEQ ID NO:222): 
1 5 5'-GG A TTC TAA TAC GAC TCA CTA TAG GGC CTT CTT TCC TTC ACC ACC ACC-3' 
DNA54228-p2 (SEQ ID NO:223): 

5*-CTA TGA AAT TAA CCC TCA CTA AAG GGA TCT GCC TTG GCT TTT GAC AC-3* 

DNA54231 (mFIZZ3) : 
20 IS98-070: 

DNA5423! (SEQ ID NO: 139) shewed a moderate signal that is specific to adipocytes. This signal 
was present in mesenteric fat and in interstitial fat in the neck around the trachea. The expression pattern 
appears to be specific for adult fat. 

25 IS98-I09: 

The expression of DNA5423 1 (SEQ ID NO: 139) was specific to adipocytes and was present wherever 
such cells were found which in this study included the peritoneal mesentery, perirenal fat in the renal pelvis, 
and the mammary fat pad. There was no expression in any other cell type in normal murine brain, liver, 
kidney, mammary gland, pancreas, spleen, pancreas, bone marrow, stomach, duodenum, jejunum, ileum, colon, 
30 cecum, testis, skin, or lung. 

The selective distribution of this molecule to adipocytes suggests a role in either fat metabolism or the 
production/genesis of adipocytes, either of which is important in obesity. 

The probes used for the above procedure were the following: 
DNA5423 1 -p 1 (SEQ ID NO:224): 
35 5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CGA GGG GGA CAG GAG CTA ATA-3' 
DNA5423I-p2 (SEQ ID NO:225): 

5*-CTA TGA AAT TAA CCC TCA CTA AAG GGA GTC CCA CGA GCC ACA GG-3' 
DNAS9294 (IS98-138): 
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DNA 59294 (SEQ ID NO: 149) was evaluated in a panel consisting of normal adult and tetal tissues 
and tissues with inflammation, predominantly chronic lymphocytic inflammation. In summary, the expression 
was specific to muscle, certain types of smooth muscle in the adult and in skeletal and smooth muscle in the 
human fetus. The expression in adult human was in smooth muscle of tubular organs evaluated including 
5 colon and gall bladder. There was no expression in the smooth muscle of vessels or bronchi. No adult human 
skeletal muscle was evaluated. In fetal tissues there was moderate to high diffuse expression in skeletal muscle 
the axial skeleton and limbs. There was weak expression in the smooth muscle of the intestinal wall but no 
expression in cardiac muscle. 

In adult tissues, the colon showed a low level of diffuse expression in the smooth muscle (tunica 
10 muscularis) in 5 specimens with chronic inflammatory bowel disease. In the gall bladder, there was weak to 
low level expression in the smooth muscle of the gall bladder. 

In fetal human tissues, there was moderate diffuse expression in skeletal muscle and weak to low 
expression in smooth muscle. However expression was not detected in the fetal heart or any other fetal organ 
including liver, spleen. CNS. kidney, gut. lung. 
15 The additional human tissues tested with no detectable expression included: lung with chronic 

granulomatous inflammation and chronic bronchitis (5 patients), peripheral nerve, prostate, heart, placenta, 
liver (disease multiblock including acetomihopin induced injury and cirrhosis), brain (cerebrum and 
cerebellum), tonsil (reactive hyperplasia), peripheral lymph node, thymus. 

The probes used in the above procedure were the following: 
20 626.pl (SEQ ID NO:226): 

5'-GGA TTC TAA TAG GAG TCA CTA TAG GGC CGG AAT GGA CTG GCC TCA CAA-3' 
626.p2 (SEQ ID NO:227): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA AGG ATG GTC TCG GGC TGC TG-3' 

25 DNA30868 (IS9 7-044) 

DNA30868 expression was found in the following fetal tissues: spinal cord, autonomic ganglia, 
enteric nerves, sacral plexus, peripheral and cranial nerves. 

The fetal tissues examined were the following: Placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, 
heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, 
30 body wall, pelvis and lower limb. 

The adult tissues examined included: Liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, 
lung and skin. 

The probes used for the above procedure were the following: 
DNA30868.pl (Clll-G): (SEQ ID NO:304) 

35 5'-GGA TTC TAA TAC G AC TCA CTA TAG GGC AG A GAC AGG GCA AGC AGA ATG-3* 
DNA30868.p2 (Clll-H): (SEQ ID NO:305) 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA G AA GGG GAT GAC TGG AGG AAC-3 1 

DNA53517 : 
40 IS98-070: 
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DNA53517 (SEQ ID NO:255) expression in the normal adult murine iung was patchy, with expression in a 
subset of mucosal epithelial cell in the large airway (bronchi/bronchioles). There is also expression within the 
rare discrete cells in the submucosal interstitium adjacent to the large airways. These cells, typically 1 -3 within 
a positive focus, are adjacent to large vessels and may represent smooth muscle ceils, peripheral nerves or 
5 Schwann ceils, or lymphatics. 

In the murine adult lung with allergic inflammation (eosinophilic, lymphocytic vasculitis, 
bronchiolitis and pneumonitis), there was diffuse strong expression in all mucosal epithelial cells of all of the 
large airways (bronchi/bronchioles) of the lung. There was also strong expression in discrete cells that 
represent a subset of epithelial cells that line the alveoli: these cells are type II pneumocytes. There is also 
10 expression, as in normal lung, present within rare discrete cells in the submucosal interstitium adjacent to the 
large airways. 

In normal adult murine small and large intestine, there is strong expression within multifocal few 
discrete single cells that are present in the submucosa. the tunica muscularis and the mesentery. The cells that 
express the signal are almost always associated with nerve, vein, artery triads within these areas. "These cells 
15 are spindle shaped and may be cither a peripheral nerves. Schwann cells 

associated with such nerves or some type of support cell associated with vessel or lymphatics. Interestingly, 
there is no expression within identifiable myenteric plexi that are present within the tunica muscularis. 

In inflamed large bowel (from an IL10R KO mouse) the pattern of expression is similar but 
expression level is significantly decreased. 

20 

IS98-093: 

The distribution of DNA53715 (SEQ ID NO:255) was further evaluated in a broad screen of normal 
murine tissues. In normal lung, expression is variable but when present was restricted to murine bronchial 
epithelial cells and type II alveolar cells in the lung. There is a marked increase in expression in these cells in 

25 inflamed lung (allergic inflammation with bronchial mucosal hypertrophy/hyperplasia; asthma model). The 
expression of DNA53715 (SEQ ID NO:255) in the bowel is most prominent in the colon and is present in few 
discrete cells within the submucosa and mucosa muscularis. the thin, well vascularized tissue layer between the 
muscle wall of the bowel and the mucosa proper. The exact identity of these cells has not been delineated, 
however, their spindloid morphology and close association to capillaries and small vessels in the submucosa 

30 suggest the following possibilities: a subset of vascular pericytes or non-myelinated nerve fibers. 

The expression of DNA53715 (SEQ ID NO:255) in discrete cells in the bowel submucosa was 
restricted to the colon and was not seen in sections of jejunum, ileum, proximal duodenum or stomach. 
Expression was not detected in the following normal murine tissues: liver, kidney, spleen, bone marrow, lung, 
pancreas, stomach, proximal duodenum, jejunum, ileum, brain, skin, testis, or mammary glands. 

35 It is possible that DNA53715 (SEQ ID NO:255) has a role in enhancing or stimulating mucosal 

immunity in the lung. 

The probes used for the above procedure were the following: 

DNA53517.pl (C301-P): (SEQ ID NO:308) 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CCC AGG ATG CCA ACT TTG A-3' 
40 DNA535 1 7.p2 (C30 1-Q): (SEQ ID NO:309) 
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5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA AGG AGG CCC ATC TGT TCA TAG-3' 
EXAMPLE 7 

In situ Hybridization in Ceils and Diseased Tissues 
5 The in situ hybridization method of Example 6 is used to determine gene expression, analyze the 

tissue distribution of transcription, and follow changes in specific mRNA synthesis for the genes/DNAs and the 
proteins of the invention in diseased tissues isolated from human individuals suffering from a specific disease. 
These results show more specifically where in diseased tissues the genes of the invention are expressed and are 
more predictive of the particular localization of the therapeutic effect of the inhibitory or stimulatory 
10 compounds of the invention (and agonists or antagonists thereof) in a disease. Hybridization is performed 
according to the method of Example 6 using one or more of the following tissue and cell samples: 

(a) lymphocytes and antigen presenting cells (dendritic cells, Langherhans cells, macrophages and 
monocytes. NK cells); 

(b) lymphoid tissues: normal and reactive lymph node, thymus. Bronchial Associated Lymphoid 
1 5 Tissues. (BALT). Mucosal Associated Lymphoid Tissues (MALT); 

(c) human disease tissues: 

o Synovium and joint of patients with Arthritis and Degenerative Joint Disease; 
o Colon from patients with Inflammatory Bowel Disease including Ulcerative Colitis and 
Crohns* disease: 

20 o Skin lesions from Psoriasis and other forms of dermatitis; 

o Lung tissue including BALT and tissue lymph nodes from chronic and acute bronchitis, 
pneumonia, pneumonitis, pleuritis: 

o Lung tissue including BALT and tissue lymph nodes from Asthma; 

o nasal and sinus tissue from patients with rhinitis or sinusitis; 
25 ° Brain and Spinal cord from Multiple Sclerosis. Alzheimer's Disease and Stroke: 

o Kidney from Nephritis. Glomerulonephritis and Systemic Lupus Erythcmatosis; 

o Liver from Infectious and non-infectious Hepatitis and acetaminophen-induced liver cirrhosis; 

o Tissues from Neoplasms/Cancer. 

30 Expression is observed in one or more cell or tissue samples indicating localization of the therapeutic 

effect of the compounds of the invention (and agonists or antagonists thereof) in the disease associated with the 
cell or tissue sample. 

The sequences of the oligonucleotides used, where expression overlaps with the non-diseased tissue 
distribution reported earlier is recited in Example 6. 

35 

DNA30942 : 

IS98-02I: Expression was observed in mononuclear phagocytes in the normal chimp thymus, as well as in a 
gastric carcinoma (l/l) colorectal cancer (1/1), breast cancer (2/5) and a lung cancer (1/4). Expressed by 
malignant cells in an osteosarcoma and a poorly differentiated liposarcoma. Possible signal in the malignant 
40 cells of a testicular teratoma and breast cancers ( 1/5). In one of the lung cancers scattered signal is seen over a 
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high endothelial venule within pulmonary lymphoid tissue. The fetai tissues examined (E12-EI6 weeks) 
included: placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, heart, great vessels, esophagus, 
stomach, small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, body wail, pelvis and lower limb. 
The adult human tissues examined included: liver, kidney, adrenal, myocardium, aorta, spleen, lung, skin, 
chondrosarcoma, eye, stomach, gastric carcinoma, colon, colonic carcinoma, renal cell carcinoma, prostate, 
bladder mucosa and gall bladder. Also examined was tissue derived from acetaminophen induced liver injury 
and hepatic cirrhosis. The rhesus tissues examined include: cerebral cortex (rm), hippocampus(rm). The 
chimp tissues examined included: thyroid, parathyroid, ovary, nerve, tongue, thymus, adrenal, gastric mucosa 
and salivary gland. 

IS98-Q85: Expression was observed in eight adenocarcinomas and seven squamous lung carcinomas. Actins 

were strongly positive in all tumors, indicating that all are suitable for in situ hybridization analysis. 

Expression of DNA30942 was observed in 6 of the tumors as follows: 

6727-95 / squamous carcinoma - Strongly expressed over neoplastic epithelium; 

9558-95 / squamous carcinoma - Expression over neoplastic epithelium; 

1 2235-95 / adenocarcinoma - Expression over in situ and infiltrating tumor cells; 

6545-95 & 4187-96 / squamous carcinomas - Expression over cells in tumor stroma, no expression seen over 
rumor ceils; 

12954-94 / squamous carcinoma - possible weak expression over stromal cells. 
/S99-//2. 

The in situ expression of DNA30942 (SEQ ID NO:13) was evaluated numerous chronic inflammatory 
conditions and lymphoid organs. In summary, DNA30942 (SEQ ID NO: 13) was strongly expressed in high 
endothelial venules (HEV) in the tonsil, hilar lymph node, bronchial mucosal-associated lymphoid tissue 
(BALT) in chronic asthma, patchy expression in colonic mucosa and weak inconsistent expression in gut- 
mucosal associated lymphoid tissues (GALT) HEV. 

In lymphoid tissues, there was observed strong specific expression in single sections of tonsil, hilar 
lymph node, bronchial mucosal-associated lymphoid tissue BALT) in a case of chronic asthma, and in gut 
mucosal associated lymphoid tissues in sections of IBD (GALT/MALT). In each of these lymphoid organs 
expression specifically was present in high-endothelial venules (HEV), 

In tissue in a chronic asthmatic lung, additionally to expression in BALT HEVs, specific expression 
was observed in small capillaries lined with high or reactive swollen endothelial cells in the submucosa of 
inflamed bronchi. This region was not intimately associated with BALT but was specific to the submucosal 
site for inflammatory cell trafficking to the bronchi. There was a significant submucosal infiltrate of 
eosinophils in these areas. In other sections of diseased lung (COPD and chronic interstitial pneumonia) there 
was not any expression of DNA30942 (SEQ ID NO: 13), these sections had some artifact (loss of tissue from 
slide). 

In psoriatic tissue, there was weak expression in some small dermal capillaries in psoriatic plaques. In 
tonsilar tissue, additional to expression in HEVs associated with follicles, there was also strong expression 
within the reticulated tonsillar crypt epithelium. Expression here was also in vessels in the smairintra-epithelial 
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capillaries present Expression was also within some of the epithelial ceils. This is an important 
immunological site and is involved with antigen presentation and may play a role in tolerance induction. 

In tissue isolated from patients suffering from Crohns' Disease and ulcerative colitis, coionai 
expression was present in the mucosa with patchy distribution in some but not in all cases. Expression in HEV 
5 in GALT was present as a significantly weaker signal than seen in other lymphoid tissues and was not 
consistently present even in sections where there was strong but patchy expression in the mucosa. 

In tissue isolated from acetaminophen induced liver injury and cirrhosis, there was weak expression in 
small capillaries within areas in the portal tracts with chronic lymphocytic inflammation. 

10 DNA33460 (IS98-0I5): 

The expression of DNA33460 (SEQ ID NO:20) was observed over ceils in loose connective tissue 
immediately adjacent to developing extra ocular muscle in the fetal eye. Moderate expression over soft-tissue 
sarcoma. The fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 
adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 

15 brain, eye. spinal cord, body wall, pelvis and lower limb. The adult tissues examined included the liver, 
kidney, renal cell carcinoma, adrenal, aorta, spleen, lymph node, pancreas. lung, myocardium, skin, cerebral 
cortex (rm), hippocampus (rm), cerebellum (rm), bladder, prostate, stomach, gastric carcinoma, colon, colonic 
carcinoma, thyroid (chimp), parathyroid (chimp) ovary (chimp) and chondrosarcoma. Also examined was 
tissue extracted from acetaminophen induced liver injury and hepatic cirrhosis. 

20 The probes used in this procedure were the following: 

DNA33460-p I (SEQ ID NO:204): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CAG CAC TGC CGG GAT GTC AAC-3* . 
DNA33460-p2 (SEQ ID NO:205): 

5MTTA TGA AAT TAA CCC TCA CTA AAG GGA GTT TGG GCC TCG.GAG CAC TG-3' 

25 

DNA34387 (IS98-083): 

Expression observed in lung cancer tumors and was positive in all eight squamous carcinomas and in 
6/8 adenocarcinomas. Expression levels are low to moderate in the adenocarcinomas and very strong in the 
squamous carcinomas. No expression was seen in the tumor stroma, alveoli or normal respiratory epithelium. 
30 Possible low level expression in the lymph nodes. 

Expression was observed in lung cancer. The gene was amplified in Taqman analysis of a lung tumor 
panel. Expression was observed in eight squamous carcinomas and in 6/8 adenocarcinomas. Expression was 
seen in in situ and infiltrating components. Expression levels were low to moderate in the adenocarcinomas. In 
general expression was higher in the squamous carcinomas and in two the expression was strong. Possible low 
35 level expression in lymph nodes. 

DNA35638 : 
1S98-124: 

This study examined the expression of DNA35638 (SEQ ID NO:35) in inflamed human tissues 
40 (psoriasis, IBD, inflamed kidney, inflamed lung, hepatitis (liver block), normal tonsil, adult and chimp 
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multiblocks) DHA35638 (SEQ ID NO:35) has been shown elsewhere in this application to have 
immunostimuiatory (enhances T lymphocyte proliferation in the MLR and costimulation) and proinflammatory 
properties (induces a neutrophil infiltrate in vivo). 

This study evaluated the differential expression of this molecule in vessels of inflamed human tissues 
5 as compared to non- inflamed tissues. In summary, expression was present in the endothelium/ intima of large 
vessels in the lung afflicted with chronic inflammation, in the superficial dermal vessels of the psoriatic skirt, in 
arterioles in a specimen of chronic sclerosing nephritis, and in capillaries including the perifollucular sinuses of 
tonsil. DNA35638 (SEQ ID NO:35) was not expressed (as detectable by this methodology) in normal skin 
(human foreskin specimens), normal lung, inflamed (8 IBD specimens) or normal large bowel, chronically 
1 0 inflamed or cirrhotic liver, normal adult cardiac tissue, or adrenal gland. 

DNA39523 : 
198-052: 

DNA39523 (SEQ ID NO: 45) in normal human skin (neonatal foreskin) and adult psoriatic skin both 
15 exhibited specific strong expression in the epithelial cells of the stratum basale - the single layer along the 
basement membrane which is the progenitor for all of the overlying epidermal ceils in the skin. 

There was no expression in epidermal cells in the overlying layers (stratum spinosum. straum 
granulosum. etc.). The intensity of the signal was slightly increased in psoriatic skin. Expression was also 
apparent in the dermis (the connective tissue immediately underlying the epidermis) of both normal and 
20 psoriatic skin. Expression here was most apparent in spindle shape cells within the collagen matrix - the 
stromal fibroblasts. 

In inflamed and normal bowel: Normal human large bowel and bowel with either Crohns's disease or 
ulcerative colitis had specific moderate to strong expression in a multifocal pattern within the lamina propria of 
villi. The cells labeled by in situ were spindloid stromal cells best delineated as 

25 fibroblasts. There was no expression by intestinal epithelial ceils and there was no apparent increased 
expression (intensity or frequency) in diseased bowel. Specifically there was also no correlation of expression 
and lesions in the inflamed bowel. 

The expression of DNA39523 (SEQ ID NO:45) in the skin and specific localization to the basal 
epithelial cells of the epidermis ceils suggests a potential role in differentiation/maintenance of the basal 

30 epidermal cells. This expression pattern in combination with the fact that expression occurs in cells that are 
directly adjacent to the basement lamina, suggests that the cells regulate trafficking of leukocytes into the 
epidermis. As a result DNA39523 (SEQ ID NO:45) may be a constitutively expressed signal for the 
trafficking of dendritic/Langerhan cells or lymphocytes into the epidermis. Such trafficking is a normal 
physiologic event that occurs in normal skin and is thought to be involved in irnmunosurveillance of the skin. 

35 The expression of DNA39523 (SEQ ID NO:45) in inflammatory bowel disease was not increased 

from normal tissue, and there was no correlation of its expression to inflammatory lesions. Similarly, its 
expression in the basal epidermal cells in psoriatic skin lesions was equivalent to or only slightly greater than 
that seen in normal neonatal skin (but age-matched control adult skin was not available at the time of the 
study). 

40 
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UWAS4±6(IS98-140>: 

The expression of DNA45416 (SEQ ID NO:79) was evaluated in a variety of human and non-human 
primate tissues and was found to be highly specific. Expression was present only in alveolar macrophages in 
the lung and in Kupffer cells of the hepatic sinusoids. Expression in these ceils was significantly increased 
5 when these distinct cell populations were activated. Though these two subpopulations of tissue macrophages 
are located in different organs, they have similar biological functions. Both types of these phagocytes act as 
biological filters to remove material from the blood stream or airways including pathogens, senescent cells and 
proteins and both are capable of secreting a wide variety of important proinflammatory cytokines. 

In inflamed lung (7 patient samples) expression was prominent in reactive alveolar macrophage cell 

10 populations defined as large, pale often vacuolated cells present singly or in aggregates within alveoli and was 
weak to negative in normal, non-reactive macrophages (single scattered cells of normal size). Expression in 
alveolar macrophages was increased during inflammation when these cells were both increased in numbers and 
size (activated). Despite the presence of histocytes in areas of interstitial inflammation and peribronchial 
lymphoid hyperplasia in these tissues, expression was restricted to alveolar macrophages. Many of the 

15 inflamed lungs also had some decree of suppurative inflammation: expression was not present in neutrophilic 
granulocytes. 

In liver, there was strong expression in reactive/activated Kupffer cells in livers with acute 
centri lobular necrosis (acetaminophen toxicity) or fairly marked periportal inflammation. However there was 
weak or no expression in Kupffer cells in normal liver or in liver with only mild inflammation or mild to 
20 moderate lobular hyperplasia/hypertrophy. Thus, as in the lung, there was increased expression in 
acivated/reactive cells. 

There was no expression of this molecule in histiocytes/macrophages present in inflamed bowel, 
hyperplastic/ reactive tonsil or normal lymph node. The lack of expression in these tissues which all contained 
histiocytic inflammation or resident macrophage populations strongly supports restricted expression to the 
25 unique macrophage subset populations defined as alveolar macrophage and hepatic Kupffer cells. However, 
the expression of DNA454216 (SEQ ID NO:79) spleen or bone marrow was not available for evaluation. 

Human tissues evaluated which had no detectable expression included: Inflammatory Bowel disease 
(7. patient samples with moderate to severe disease), tonsil with reactive hyperplasia, peripheral lymph node, 
psoriatic skin (2 patient samples with mild to moderate disease), heart, peripheral nerve. Chimp tissues 
30 evaluated which had no detectable expression included: tongue, stomach, thymus. 
The probes used for the above studies were the following: 
628.pl (SEQ ID NO:212): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CTC CAA GCC CAC AGT GAC AA-3* 
628.p2 (SEQ ID NO:213): 
35 5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCT CCA CAT TTC CTG CCA GTA-3' 

DNA41374 : 
IS-98-677: 

DNA4 1 374 (SEQ ID NO:248) was expressed in thymic T lymphocytes 
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Summary: In numerous tissues evaluated there expression was only detected as weak diffuse expression in 
thymic T lymphocytes. The limited distribution pattern suggests expression by T lymphocytes or cells closely 
associated with T lymphocytes such as antigen presenting cells (dendritic cell populations, ere). In inflamed 
human tissue with significant lymphocytic inflammation and presence of reactive follicle formation 
5 (inflammatory bowel disease and chronic lymphocytic 

interstitial pneumonia/bronchitis) there was no detectable expression in areas which contained significant 
numbers of T lymphocytes. The tissues tested for which there was no detectable expression included: human 
normal tissues: placenta, lung, spleen, adrenal gland, skin, kidney, eye, liver; 

human diseased tissue: liver disease: chronic hepatitis, chronic cholangitis, acute centri lobular necrosis 
10 (acetaminophen toxicity); Neoplasia (tumor muitiblock): osteosarcoma, squamous cell carcinoma; human fetal 
tissues: brairu spinal cord, lung, heart, kidney, axial and limb musculoskeleton vessels, umbilical cord; non- 
human primate: tongue, thyroid gland, parathyroid gland, stomach, salivary gland. 

IS98-I25. 

15 DNA4H74 (SEQ ID NO:248) has low level expression in non-human pnmatc thymus and in human 

tonsil in T lymphocyte specific regions. Hie limited distribution pattern suggests expression by T lymphocytes 
or cells closely associated with T lymphocytes such as antigen presenting cells (dendritic cell populations, ere). 
In inflamed tissue with significant lymphocytic inflammation and presence of reactive follicle formation 
(inflammatory bowel disease and chronic lymphocytic interstitial pneumonia/bronchitis) there was no 

20 detectable expression in areas which likely contain significant numbers of T lymphocytes. 

Inflamed lung: (chronic lymphocytic and granulomatous pneumonitis): weak to negative signal in the 
interstitium compared to the control sense probe. There was weak expression in normal chimp thymus (human 
thymus not available) and in human tonsil. In the latter the expression was predominantly in T lymphocyte 
areas of this structure including the perifollicular marginal zone and in the paracortex. 

25 There was no detectable expression in the following human tissues: inflammatory bowel disease (8 

patient specimens), chronically inflamed and normal lung (6 patient specimens), chronic sclerosing nephritis 
(1). chronically and acutely inflamed and cirrhotic liver (10 specimen muitiblock), normal and psoriatic skin, 
peripheral lymph node (non-reactive). 
The probes used for the above procedures were the following: 

30 4l374.pl (C337-G): (SEQ ID NO:306) 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CTC CAC AGA ACC TCG CCA TCA-3' 
41374.p2(C337-H): (SEQ ID NO.307) 

5'-CTA TG A AAT TAA CCC TCA CTA AAG GG A TGG GGC AAG ACT CAC AAG C AG-3' 

35 DNA53517 : 
IS98-070: 

DNA53517 (SEQ ID NO:255) expression in the normal adult murine lung was patchy, with 
expression in a subset of mucosal epithelial ceil in the large airway (bronchi/bronchioles). There is also 
expression within the rare discrete cells in the submucosal interstitium adjacent to the large airways. These 
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cells, typically 1-3 within a positive focus, are adjacent to large vessels and may represent smooth muscle cells, 
peripheral nerves or Schwann cells, or lymphatics. 

In the murine adult lung with allergic inflammation (eosinophilic, lymphocytic vasculitis,, 
bronchiolitis and pneumonitis), there was diffuse strong expression in all mucosal epithelial cells of all of the 
5 large airways (bronchi/bronchioles) of the lung. There was also strong expression in discrete cells that 
represent a subset of epithelial cells that line the alveoli; these ceils are type II pneumocytes. There is also 
expression, as in normal lung, present within rare discrete cells in the submucosal interstitium adjacent to the 
large airways. 

In normal adult murine small and large intestine, there is strong expression within multifocal few 
10 discrete single cells that are present in the submucosa, the tunica muscularis and the mesentery. The cells that 
express the signal are almost always associated with nerve, vein, artery triads within these areas. These ceils 
are spindle shaped and may be either a peripheral nerves. Schwann cells 

associated with such nerves or some type of support cell associated with vessel or lymphatics. Interestingly, 
there is no expression within identifiable myenteric piexi that are present within the tunica muscularis. 
15 In inflamed large bowel (from an IL10R KO mouse) the pattern of expression is similar but 

expression level is significantly decreased. 

IS98-US: 

DNA53715 (SEQ ID NO:255, mouse FIZZ-l) was used as a detection probe in die following human 
20 tissues: gastric carcinoma, inflamed lung (3 patients) (vessels, alveoli, large airways and mucous glands), 
aorta, heart, placenta and gall bladder. 

Expression of mouse DNA53715 (SEQ ID NO:255) was present in normal mouse lung in large airway 
epithelium and had marked increased expression in inflamed murine lung (airway epithelium, type 11 alveolar 
pneumocytes). It was also expressed in discrete cells in the submucosa of the large bowel along vascular 
25 channels. 

DNA8421Q : 

The following probes were used in the in situ studies indicated below: 
842 lO.p I (F-7961 9): (SEQ ID NO:3 1 0) 

30 5'-GGA TTC TAA TAC G AC TCA CTA TAG GGC GCG GTC GCA GG A CAT TCA GTA-3* 

84210.p2(F-79620): ■ (SEQ ID NO:311) 

y-CTATGA AAT TAA CCC TCA CTA AAG GGA ACT CTT TGG GTT CCA GCA CAC-3' 

DNA84210 (SEQ ID NO:285) is expressed in fetal kidney, primarily in developing glomeruli and 
tubules of the conical zone and also weakly in fetal lung and spinal cord. There is also expression in stromal 

35 cells adjacent to developing cartilage and bone. In adult tissues, weak expression is seen in normal bronchial 
epithelium, in one (adenocarcinoma) of five lung tumors (2 squamous and 3 adenocarcinomas) and in a 
chondrosarcoma. There is possibly expression in the skin and its appendages, however, the section is folded 
and difficult to evaluate. 

40 IS99-102: 
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Expression of DNA84210 (SEQ ID NO:285) in malignant melanoma, lung tumor, colon tumor/cell 
pellet, mouse tissues, fetal tissues. 

Expression of DNA84210 (SEQ ID NO:285) is seen in several adult (neoplastic and non-neoplastic) 
and fetal tissues. As far as normal adult tissues are concerned. DNA84210 (SEQ ID NO:285) is seen in the 
5 epidermis of skin (mostly in basaily located ceils) and in skin appendages, such as hair follicles and sebaceous 
glands associated with them. Expression is also seen in bronchial epithelium and submucosal bronchial glands. 
In human fetal tissues, expression of DNA84210 (SEQ ID NO:285) is seen in skin and skin appendages, lung, 
renal cortex and pancreatic ducts. It is also seen in mesenchymal cells adjacent to developing bone and 
cartilage. There is no hybridization signal seen in mouse embryos. Expression of DNA84210 (SEQ ID 
10 NO:285) is seen in one of six colorectal adenocarcinomas (weak), 2 of 3 lung adenocarcinomas (one shows 
strong, but very focal expression, one is very weakly positive), 0 of 3 lung squamous cell carcinomas and 1 of 
I chondrosarcomas (weak). Expression is also seen in 5 of 5 malignant melanomas, the intensity of expression 
ranges from very weak to strong. These sections also demonstrate expression of DNA84210 (SEQ ID NO:285) 
in normal epidermis and skin appendages. 
15 EXAMPLE 8 

Use of the PRO polypeptides as a hybridization probe 
The following method describes use of a nucleotide sequence encoding the PRO polypeptides as a 
hybridization probe. 

DNA comprising the coding sequence of full-length or mature PRO polypeptides is employed as a 
20 . probe to screen for homologous DNAs (such as those encoding naturally-occurring variants) in human tissue 

cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 

high stringency conditions. Hybridization of radiolabeled PRO - derived probe (e.£, PRO200. PRO204. 

PR02I2. PR0216. PR0226. PRO240. PR0235. PR0245. PRO 172. PR0273, PR0272. PR0332. PR0526. 
25 PRO701. PR0361. PR0362. PR0363. PR0364. PR0356. PR0531, PR0533. PRO1083. PR0865. PRO770. 

PR0769. PR07S8'. PROl 114. PRO1007. PROl 184. PRO1031. PR01346. PR01155. PRO1250. PR013I2. 

PROl 192. PROI246. PROI283. PROl 195. PR01343. PR014I8, PR01387, PRO1410, PR01917. PR01868. 

PRO205. PR021. PR0269. PR0344. PR0333. PR0381. PRO720. PR0866, PRO840. PR0982. PR0836. 

PROl 159. PR01358. PR01325. PR01338. PR01434. PR04333. PRO4302, PRO4430 or PR05727) to the 
30 filters is performed in a solution of 50% formamide. 5x SSC, 0. 1% SDS. 0. 1% sodium pyrophosphate. 50 mM 

sodium phosphate. pH 6.8. 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of 

the filters is performed in an aqueous solution of 0. lx SSC and 0. 1% SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO 

polypeptide can then be identified using standard techniques known in the art. 

35 



EXAMPLE 9 
Expression of the PRO polypeptide in E, coli 
40 This example illustrates preparation of an unglycosylated form of the PRO polypeptides by 
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recombinant expression in E. coli. 

The DNA sequence encoding the PRO polypeptide is initially amplified using selected PCR primers. 
The primers should contain restriction enzyme sites which correspond to the restriction enzyme sites on the 
selected expression vector. A variety of expression vectors may be employed. An example of a suitable vector 
5 is pBR322 (derived from E. coli: see Bolivar et aL Gene. 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylatcd. . The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which 
encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, 
polyhis sequence, and enterokinase cleavage site), the PRO polypeptide coding region, lambda transcriptional 
10 terminator, and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
Sambrook et aL, supra. Transformants are identified by their ability to grow on LB plates and antibiotic 
resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and 
DNA sequencing. 

*5 Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 

antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells arc 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culruring the ceils for several more hours, the cells can be harvested by ccntrifugation. The cell 
pellet obtained by the centrifugation can be solubiiized using various agents known in the art. and the 

20 solubiiized PRO polypeptide protein can then be purified using a metal chelating column under conditions that 
allow tight binding of the protein. 

The PRO polypeptides may also be expressed in E. coli in a poly-His tagged form, using the following 
procedure. The DNA encoding a PRO polypeptide is initially amplified using selected PCR pnmers. The 
primers contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 

25 expression vector, and other uscrul sequences providing for efficient and reliable translation initiation, rapid 
purification on a metal chelation column, and proteolytic removal with enterokinase. The PCR-amplificd. 
poly-His tagged sequences are then ligated into an expression vector, which is used to transform an E. coli host 
based on strain 52 (W31 10 tuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in 
LB containing 50 mg/'ml carbeniciilin at 30*C with shaking until an O.D.600 of 3-5 is reached. Cultures are 

30 then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S04, 071 g sodium 
citrate- 2H20, 1.07 g KCl. 5.36 g Difco yeast extract, 5.36 g Sheffield hycasc SF in 500 mL water, as well as 
1 10 mM MPOS, pH 7.3. 0.55% (w/v) glucose and 7 mM MgS<2>4) and grown for approximately 20-30 hours at 
30*C with shaking. Samples are removed to verify expression by SDS-PAGE analysis, and the bulk culture is 
centrifuged to pellet the ceils. Cell pellets are frozen until purification and refolding. 

35 £. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 

guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0.1 M and 0.02 M, respectively, and the solution is stirred overnight at 4'C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 
40,000 rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal 

40 chelate column buffer (6 M guanidine. 20 mM Tris. pH 7.4) and filtered through 0.22 micron filters to clarify. 
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Depending on condition the clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column 
equilibrated in the metal chelate column buffer. The column is washed with additional buffer containing 50 
mM imidazole (Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 mM 
imidazole. Fractions containing the desired protein was pooled and stored at 4"C. Protein concentration is 
5 estimated by its absorbance at 280 nm using the calculated extinction coefficient based on its amino acid 
sequence. 

The proteins are refolded by diluting sample slowly into freshly prepared refolding buffer consisting of: 
20 mM Tris. pH 8.6. 0.3 M NaCl 2.5 M urea. 5 mM cysteine. 20 mM glycine and 1 mM EDTA. Refolding 
volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The refolding 

10 solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to 
a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the solution 
is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The refolded 
protein is chromatographed on a Poros RI/H reversed phase column using a 'mobile buffer of 0.1% TFA with 
elution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance are 

15 analyzed on SDS poiyacrylamide ceis and fractions containing homogeneous refolded protein arc pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 
since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfoldcd forms of proteins from the desired form, the reversed phase step also removes endotoxin 

20 from the samples. 

Fractions containing the desired folded PRO polypeptide proteins are pooled and the acetonitrile 
removed using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, 
pH 6.8 with 0.14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine 
(Pharmacia) resins equilibrated in the formulation buffer and sterile filtered. 

25 

EXAMPLE 10 
Expression of the PRO polypeptides in mammalian cells 
This example illustrates preparation of a potentially glycosylated form of the PRO polypeptide in 
recombinant expression in, mammalian cells. 
30 The vector. pRK5 (see EP 307,247, published March 15. 1989), is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the 
respective PRO DNA usmg ligation methods such as described in Sambrook et ai t supra . The resulting vector 
is called, for example, pRK5-PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
35 grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 ug of the pRK5-PRO DNA is mixed with about 1 
DNA encoding the VARNA gene [Thimmappaya et aL, Cell, 31:543 (1982)] and dissolved in 500 uL of 1 
mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaCk To this mixture is added, dropwise, 500 uL of 50 mM HEPES 
(pH 7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 
40 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
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culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with 
culture medium (alone) or culture medium containing 200 uCi/ml ,5 S-cysieine and 200 uCi/mi M S- methionine. 
5 After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto 
a 15% SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of the polypeptide of the invention polypeptide. The cultures containing transfected cells may 
undergo further incubation (in serum free medium) and the medium is tested in selected btoassays. 

In an alternative technique, pRK5-PRO may be introduced into 293 cells transiently using the dextran 

10 sulfate method described by Somparyrac et aL. Proc. NatL Acad. ScL. 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 ug pRK5-PRO is added. The cells are first concentrated from the 
spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium. 5 ug/ml bovine insulin and 

15 0.1 ug/mi bovine transferrin. After about tour days, the conditioned media is centrifuced and filtered to 
remove cells and debris. The sample containing the expressed polypeptide of the invention can then be 
concentrated and purified by any selected method, such as dialysis and/or column chromatography. 

In another embodiment, the polypeptides of the invention can be expressed in CHO cells. The pRJC5- 
PRO can be transfected into CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described 

20 above, the cell cultures can be incubated, and the medium replaced with culture medium (alone) or medium 
containing a radiolabel such as M S-methionine. After determining the presence of a polypeptide of the 
invention polypeptide, the culture medium may be replaced with serum free medium. Preferably, the cultures 
are incubated for about 6 days, and then the conditioned medium is harvested. The medium containing the 
expressed polypeptide of the invention can then be concentrated and purified by any selected method. 

25 Epitope-tacgcd polypeptide of the invention may also be expressed in host CHO cells. The DNA 

encoding the desired polypeptide of the invention may be subcioned out of the pRK5 vector. The subclone 
insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly-his tag into a Daculovirus 
expression vector. The poly-his tagged polypeptide of the invention insert can then be subcioned into a SV40 
driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO 

30 cells can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as 
described above, to verify expression. The culture medium containing the expressed poIy-His tagged 
polypeptide of the invention can then be concentrated and purified by any selected method, such as by Ni 2 *- 
cheiate affinity chromatography. 
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EXAMPLE 1 1 
Expression of PRO in Yeast 
The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of the PRO 
5 polypeptide from the ADH2/GAPDH promoter. DNA encoding a polypeptide of the invention and the 
promoter is inserted into suitable restriction enzyme sites in the selected plasmid to direct intracellular 
expression of the PRO. For secretion, DNA encoding the PRO can be cloned into the selected plasmid, 
together with DNA encoding the ADH2/GAPDH promoter, a native sequence PRO signal peptide or other 
mammalian signal peptide, or. for example, a yeast alpha- factor or invenase secretory signal/leader sequence. 

10 and linker sequences (if needed) for expression of the polypeptide of the invention. 

Yeast cells, such as yeast strain AB110. can then be transformed with the expression piasmids 
described above and cultured in selected fermentation media. The transformed yeast supernatants can be 
analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of 
the gels with Coomassie Blue stain. 

15 Recombinant PRO can subsequently be isolated and purified by removing the yeast ceils from the 

fermentation medium by ccntnfugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing the polypeptide of the invention may further be purified using selected column 
chromatography resins. 

20 EXAMPLE 12 

Expression of PRO in Baculovirus-Infected Insect Cells 
The following method describes recombinant expression of PRO in Baculovirus-infccted insect cells. 
The sequence coding for PRO is tiised upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
25 A variety of piasmids may be employed, including piasmids derived from commercially available piasmids 
such as pVL!393 (Novagen). Briefly, the sequence encoding a polypeptide of the invention or the desired 
portion of the coding sequence of the DNA encoding a PRO polypeptide [such as the sequence encoding the 
extracellular domain of a transmembrane protein or the sequence encoding the mature protein if the protein is 
extracellular] is amplified by PCR with primers complementary to the 5* and 3' regions. The 5' primer may 
30 incorporate flanking (selected) restriction enzyme sites. The product is then digested with those selected 
restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGoId™ virus 
DNA (Pharmingen) into Spodoptera Jrugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin 
(commercially available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C, the released viruses are 
35 harvested and used for further amplifications. Viral infection and protein expression are performed as 
described by O'Reilley et aL Baculovirus expression vectors: A Laboratory Manual. Oxford: Oxford 
University Press ( 1 994). 

Expressed poly-his tagged polypeptide of the invention can then be purified, for example, by Nt 2+ - 
chelate affinity chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as 
40 described by Rupert et aL Nature, 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in 

' . ■ 157 

SUBSTITUTE SHEET (RULE 26) 



WO 00/53758 



PCTAJSOO/05841 



sonication buffer (25 mL Hepes. pH 7.9: 12.5 mM MgCh; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M 
KG), and sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugauon. and the 
supernatant is diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCl. 10% glycerol. pH 7.8) and 
filtered through a 0.45 um filter. A Ni 2 *-NTA agarose column (commercially available from Qiagen) is 

5 prepared with a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading 
buffer. The filtered ceil extract is loaded onto the column at 0.5 mL per minute. The column is washed to 
baseline A 2 so with loading buffer, at which point fraction collection is started. Next, the column is washed with 
a secondary wash buffer (50 mM phosphate: 300 mM NaCl. 10% glycerol. pH 6.0). which elutes 
nonspecifically bound protein. After reaching A 2 ao baseline again, the column is developed with a 0 to 500 

10 mM Imidazole gradient in the secondary wash buffer. One mL fractions are collected and analyzed by SDS- 
PAGE and silver staining or Western blot with Ni 2 "-NTA-conjugated to alkaline phosphatase (Qiagen). 
Fractions containing the eluted His !0 -tagged- polypeptide of the invention are pooled and dialyzed against 
loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO polypeptide can be performed using 
1 5 known chromatography techniques, including for instance. Protein A or protein G column chromatography. 

EXAMPLE 13 
Preparation of Antibodies that Bind PRO 
This example illustrates preparation of monoclonal antibodies which can specifically bind the 
20 polypeptides of the invention. 

Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra. Irnmunogens that may be employed include the purified polypeptide of the 
invention itself, fusion proteins containing the respective polypeptide of the invention, and cells 
expressing recombinant polypeptide of the invention on the cell surface. Selection of the immunogen can be 
25 made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c. are immunized with the polypeptide of die invention immunogen emulsified in 
complete Freund's adjuvant and injected subcutaneously or intraperitoneal ly in an amount from I- 100 
micrograms. Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical 
Research. Hamilton. MT) and injected into the animal's hind foot pads. The immunized mice are then boosted 
30 10 to 12 days later with additional immunogen emulsified in the selected adjuvant. Thereafter, for several 
weeks, the mice may also be boosted with additional immunization injections. Serum samples may be 
periodically obtained from the mice by retro-orbital bleeding for testing in ELISA assays to detect antibodies 
specific to the respective polypeptide of the invention. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
35 with a final intravenous injection of the respective polypeptide of the invention. Three to four days later, the 
mice are sacrificed and the spleen cells are harvested. The spleen cells are then fused (using 35% polyethylene 
glycol) to a selected murine myeloma cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. 
The fusions generate hybridorna cells which can then be plated in 96 well tissue culture plates containing HAT 
! (hypoxan thine, aminopterm. and thymidine) medium to inhibit proliferation of non- fused cells, myeloma 

40 hybrids, and spleen cell hybrids. 
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The hybridoma cells are screened in an ELISA for reactivity against the respective polypeptide of the 
invention. Determination of "positive" hybridoma ceils secreting the desired monoclonal antibodies against the 
polypeptides of the invention is within the skill in the art 

The positive hybridoma cells can be injected intraperitonealiy into syngeneic Balb/c mice to produce 

5 ascites containing the anti-PRO200, anti-PRO204, anti-PR0212, anti-PR0216, antx-PR0226, anti-PRO240, 
anti-PR0235. anti-PR0245, anti-PROl 72, anti-PR0273. anti-PR0272. anti-PR0332, anti-PR0526, anti- 
PRO701, anti-PR036h anti-PR0362. ami-PR0363. anti-PR0364, anti-PR0356, anti-PR0531, anti-PR0533, 
anri-PROt083. anti-PR0865, anti-PRO770, anti-PR0769; ami-PR0788. anti-PROtl 14. anti-PRO1007, anti- 
PROl 184, anti-PRO103i, anti-PRO 1 346, anti-PROl 155, anti-PROl250. ami-PR013l2, anti-PROl 192, anti- 

10 PR01246, anti-PROI283, anti-PROl 195, anti-PROI343, anti-PR014l8, anti-PR01387. anti-PROHIO, anti- 
PR01917, anti-PR01868. anti-PRO205, anti-PR02I. anti-PR0269, anti-PR0344, anti-PR0333, anti-PR0381, 
anti-PRO720, anti-PR0866, anti-PRO840. anti-PR0982, anti-PR0836, anti-PROl 159. anti-PROI358, anti- 
PR01325, anti-PR01338, anti-PR01434, anti-PR04333, anti-PRO4302, anti-PRO4430 or anti-PR05727 
monoclonal antibodies. Alternatively, the hybridoma cells can be grown, in tissue culture flasks or roller 

15 bottles. Purification of the monoclonai antibodies produced in the ascites can be accomplished using 
ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, affinity 
chromatography based upon binding of antibody to protein A or protein G can be employed. 

Deposit of Material 

20 The following materials have been deposited with the American Type Culture Collection, 10801 

University Blvd.. Manassas. VA 201 10-2209, USA (ATCC): 





Material 


UNO 


PRO 


ATCC# 


ATCC Deposit Date 




DNA29I0I-1276 


174 


200 


209653 


March 5, 1998 


25 


DNA30871-M57 


178 


204 


209380 


October 16, 1997 




DNA30942-1I34 


186 


-212 


209254 


September 16. 1997 




DNA33087-1158 


190 


216 


209381 


October 16, 1997 




DNA33460-1166 


200 


226 


209376 


October 16, 1997 




DNA34387-U33 


214 


240 


209260 


September 16, 1997 


30 


DNA35558-I167 


209 


235 


209374 


October 16, 1997 




DNA35638-1141 


219 


245 


209265 


September 16 t 1997 




DNA35916-1161 


146 


172 


209419 


October 28, 1997 




DNA39523-1192 


240 


273 


209424 


October 31, 1997 




DNA40620-1183 


239 


272 


209388 


October 17, 1997 


35 


DNA40982-1235 


293 


332 


209433 


November 17, 1997 




DNA44184-1319 


330 


526 


209704 


March 26, 1998 




DNA44205-1285 


365 


701 


209720 


March 31, 1998 




DNA45410-1250 


316 


361 


209621 


February 5, 1998 




DNA45416-1251 


317 


362 


209620 


February 5, 1998 


40 


DNA45419-1252 


318 


363 


209616 


February 5, 1998 
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DNA47j65-120o 


319 


364 


209436 


November 7, 1997 




DNA47470-1 130 


313 


356 


209422 


October 28, 1997 




DNA483 14-1320 


332 


531 


209702 


March 26, 1998 




DNA49435-1219 


334 


533 


209480 


November 21 , 1997 


5 


DNA5092 1-1458 


540 


1083 


209859 


May 12, 1998 




DNA53 974- 1401 


434 


865 


209774 


April 14, 1998 




DNA54228-1366 


408 


770 


209801 


April 23. 1998 




DNA54231-1366 


407 


769 


209802 


April 23. 1998 




DNA56405-1357 


430 


788 


209849 


May 6, 1998 


10 


DNA57033-1403 


557 


1114 


209905 


May 27, 1998 




DNA57690-1374 


491 


1007 


209950 


June 9, 1998 




DNA59220-1514 


598 


1184 


209962 


June 9, 1998 




DNA59294-I381 


516 


1031 


209866 


May 14, 1998 




DNA59776-1600 


701 


1346 


203128 


August 18, 1998 


15 


DNA59849-1504 


585 


1155 


209986 


June 16. 1998 




DNA60775-I532 


633 


1250 


203173 


September 1. 1998 




DNA61873-1574 


678 


1312 


203132 


August 18. 1998 




DNA628 14-1521 


606 


1192 


203093 


August 4. 1998 




DNA64885-1529 


630 


1246 


203457 


November 3, 1998 


20 


DNA65404-I551 


653 


1283 


203244 


September 9.. 1998 




DNA65412-I523 


608 


1195 


203094 


August 4, 1998 




DNA66675-1587 


698 


1343 


203282 


September 22, 1998 




DNA68864-1629 


732 


1418 


203276 


September 22. 1998 




DNA68872-I620 


722 


1387 


203160 


August 25. 1998 


25 


DNA68874-I622 


728 


1410 


203277 


September 22, 1998 




DNA76400-2523 


900 


1917 


203573 


January 12. 1999 




DNA77624-2515 


859 


1368 


203553 


December 22. 1998 




DNA30868-1I56 


179 


205 





March 2, 2000 




DNA36638-1056 


21 


21 


209456 


November 12, 1997 


30 


DNA38260-1180 


236 


269 


209397 


October 17, 1997 




DNA40592-1242 


303 


344 


209492 


November 21, 1997 




DNA41374-1312 


294 


333 








DNA44194-I317 


322 


381 


209808 


April 28. 1998 




DNA53517-1366 


388 


720 


209802 


April 23. 1998 


35 


DNA53971-1359 


435 


866 


209750 


April 7, 1998 




DNA53987-1438 


433 


840 


209858 


May 12, 1998 




DNA57700-1408 


483 


982 


203583 


January 12, 1999 




DNA59620-1463 


545 


836 


209989 


June 16, 1998 




DNA60627-1508 


589 


1159 


203092 


August 4, 1998 


40 


DNA64890-I612 


707 


1358 


203131 


August 18, 1998 
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DNArtfi/iSQ-l SOI 

L/|iAUUUJ7' 1 J7J 


Oo J 


1 "OS 




oeptemoer zi, lWo 






1 ^TS 
i JJO 




c«a> AM u*» to toon 

September /x, iwa 


DNA688 1 8-2536 


739 


1434 ~ 


203657 


February 9, 1999 


DNA842 10-2576 


1888 


4333 


203818 


March 2, 1999 


DNA922 18-2554 


1866 


4302 


203834 


March 9, 1999 


DNA96878-2626 


1947 


4430 


23-PTA 


May 5, 1999 


DNA98853-1739 


2448 


5727 


203906 


April 6, 1999 



These deposits was made under the provisions of the Budapest Treaty on the International 

10 Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations 
thereunder (Budapest Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the 
date of deposit. The deposit will be made available by ATCC under the terms of the Budapest Treaty, and 
subject to an agreement between Genentech, Inc. and ATCC, which assures permanent and unrestricted 
availability of the progeny of the culture of the deposit to the public upon issuance of the pertinent U.S. patent 

1 5 or upon laying open to the public of any U.S. or foreign patent application, whichever comes first, and assures 
availability of ihc progeny to one determined by the U.S. Commissioner of Patents and Trademarks to be 
entitled thereto according to 35 USC 122 and the Commissioner's rules pursuant thereto (including 37 CFR 
1.14 with particular reference to 886 OG 63S). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 

20 die or be lost or destroyed when cultivated under suitable conditions, die materials will be promptly replaced 
on notification with another of the same. Availability of the deposited material is not to be construed as a 
license to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 

25 practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. "The deposit of material herein docs not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 

30 claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the an from the foregoing 
description and fall within the scope of the appended claims. 
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What is claimed; 

1. A composition useful for the treatment of immune related diseases, comprising a PRO200, 
PRO204. PR0212, PR0216, PR0226. PRO240. PR0235, PR0245, PR0172, PR0273, PR0272, PR0332, 
PR0526, PRO701, PR0361, PR0362, PR0363, PR0364. PR0356, PR053I, PR0533, PRO1083. PR0865, 
PRO770, PR0769, PR0788, PROl 114, PRO1007, PROl 184, PRO103I, PR01346, PROl 155, PRO1250. 
PR01312, PROl 192, PR01246, PR01283. PROl 195. PR01343, PR01418. PR01387, PROI410, PR01917. 
PRO 1868. PRO205. PR021. PR0269, PR0344, PR0333, PR0381, PRO720, PR0866, PRO840, PR0982. 
PR0836, PROl 159, PR01358, PR01325, PR01338. PR01434, PR04333, PRO4302, PRO4430 or PR05727 
polypeptide, agonist or fragment thereof and a carrier or excipient. having the properties of: 

(a) increasing infiltration of inflammatory cells into a tissue of a mammal in need thereof, 

(b) stimulating or enhancing an immune response in a mammal in need thereof, or 

(c) increasing the proliferation of T-lymphocytes in a mammal in need thereof in response to an 
antigen. 

2. The composition of claim 1 comprising an effective amount of a PRO200. PRO204. 
PR0212. PR0216. PR0226. PRO240. PR0235. PR0245, PR0I72. PR0273. PR0272. PR0332. PR0526. 
PRO701. PR0361. PR0362. PR0363. PR0364. PR0356, PR0531. PR0533. PRO1083. PR0865. PRO770. 
PR0769. PR0788. PROl 1 14, PROI007, PR01134. PRO1031. PR01346, PR01155, PRO1250, PR01312. 
PROl 192. PR01246. PROI283. PROl 195, PR01343. PR014I8, PR01387, PRO1410. PR01917, PR01868, 
PRO205. PR02I, PR0269. PR0344. PR0333. PR0381, PRO720, PR0866, PRO840, PR0982. PR0836. 
PROl 159. PR0I358, PR01325, PROI338. PR01434, PR04333, PRO4302, PRO4430 or PR05727 
polypeptide, agonist, antagonist or fragment thereof. 

3. The composition of claim 2 further comprising a growth inhibitory agent, cytotoxic agent or 
chemo therapeutic agent. 

4. Use of a PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235. PR0245. 
PRO 172. PR0273, PR0272. PR0332. PR0526. PRO701, PR0361, PR0362, PR0363, PR0364. PR0356. 
PR0531. PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788. PROl 1 14, PRO1007. PRO 11 84. 
PRO1031. PR01346, PROl 155, PRO1250. PR01312, PROl 192, PR01246, PR01283, PROl 195. PR01343, 
PR01418, PR01387, PRO1410, PR01917, PR01868. PRO205, PR021, PR0269, PR0344, PR0333, 
PR0381, PRO720, PR0866. PRO840. PR0982, PR0836, PROl 159, PR01358, PR01325, PR01338, 
PRO 1434, PR04333, PRO4302. PRO4430 or PR05727 polypeptide, agonist or a fragment thereof to prepare 
a composition having the properties of: 

(a) increasing infiltration of inflammatory ceils into a tissue of a mammal in need thereof, 

(b) stimulating or enhancing an immune response in a mammal in need thereof, or 

(c) increasing the proliferation of T-lymphocytes in a mammal in need thereof in response to an 
antigen. 

5. The use of claims 4 comprising an effective amount of a PRO200, PRO204, PR0212, 
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PR0216, PR0226. PRO240, PR0235, PR0245, PROI72, PR0273, PR0272, PR0332. PR0526, PRO701, 
PR0361, PR0362, PR0363, PR0364, PR0356, PR0531, PR0533, PRO1083, PR0865, PRO770, PR0769, 
PR0788, PR01I14, PRO1007, PR01184, PRO103I, PR01346, PROU55, PRO1250, PR01312, PROU92, 
PR01246, PROI283, PR01195, PR01343, PR01418, PR01387, PRO1410, PR01917, PR01868, PRO205, 
5 PR021, PR0269, PR0344, PR0333, PR0381, PRO720, PR0866, PRO840, PR0982, PR0836, PROU59, 
PR01358. PR01325, PR01333, PR01434, PR04333, PRO4302, PRO4430, PR05727 polypeptide, agonist, 
antagonist or fragment thereof. 

6. The composition of claim 2 further comprising a growth inhibitory agent, cytotoxic agent or 
10 chemotherapeutic agent. 

7. A method of treating an immune related disorder, such as a T cell mediated disorder, in a 
mammal in need thereof, comprising administering to the mammal an effective amount of a PRO200. PRO204, 
PR02I2. PR0216, PR0226. PRO240, PR0235, PR0245, PR0172, PR0273, PR0272, PR0332, PR0526. 

15 PRO70I. PR0361. PR0362! PR0363, PR0364. PR0356. PR053L PR0533. PRO1083. PR0865, PRO770. 

PR0769. PR0788, PR01114, PRO1007. PR01184. PRO1031, PR01346, PROI155. PROI250. PR01312. 

PR01192. PROI246, PR01283, PROU95. PR01343, PR01418, PR01387. PRO1410, PR01917. PR01868. 

PRO205. PR021, PR0269. PR0344, PR0333, PR0381, PRO720, PR0866, PRO840. PR0982. PR0836, 

PR01159. PR01358. PR01325, PR01338, PR01434, PR04333, PRO4302, PRO4430 or PR05727 
20 polypeptide, an agonist antibody thereof, an antagonist antibody thereto, or a fragment thereof. 

8. The method of claim 7. wherein the disorder is selected from systemic lupus erythematosis, 
rheumatoid anhritis, osteoarthritis, juvenile chronic arthritis, spondyloarthropathies, systemic sclerosis, 
idiopathic inflammatory myopathies. Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune 

25 hemolytic anemia, autoimmune thrombocytopenia, thyroiditis, diabetes mellitus. immune-mediated renal 
disease, demyelinating diseases of the central and peripheral nervous systems such as multiple sclerosis, 
idiopathic demyelinating polyneuropathy or Guillain-Barre syndrome, and chronic inflammatory 
demyelinating polyneuropathy, hepatobiliary diseases such as infectious, autoimmune chronic active hepatitis, 
primary biliary cirrhosis, granulomatous hepatitis, and sclerosing cholangitis, inflammatory bowel disease. 

30 gluten-sensitive enteropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases including 
bullous skin diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, 
allergic rhinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the lung such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and hypersensitivity pneumonitis, transplantation 
associated diseases including graft rejection and graft -versus-host-disease. 

35 

9. The composition or use of any of the preceding claims, wherein the agonist or antagonist is a 
monoclonal antibody. 

10. The composition or use of any of the preceding claims, wherein the agonist or antagonist is 
40 an antibody fragment or a single-chain antibody. 
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11. The composition or use of claims 9 or 10, wherein the antibody has nonhuman 
complementarity determining region (CDR) residues and human framework region (FR) residues. 

5 12. A method for determining the presence of a PRO200, PRO204, PR0212, PR0216, PR0226, 

PRO240. PR0235, PR0245, PR0172, PR0273, PR0272, PR0332, PR0526. PRO701. PR0361, PR0362, 
PR0363. PR0364, PR0356, PR053L PR0533, PRO1083 ? PR0865, PRO770, PR0769, PR0788, PROU14, 
PRO1007. PRO 1 184, PRO1031, PR01346, PROI155. PRO1250, PR01312. PROl 192. PR01246, PR01283, 
PR01195, PR01343, PR01418, PR01387, PRO1410, PR01917, PR01868. PRO205. PR021, PR0269, 

10 PR0344. PR0333, PR038I, PRO720, PR0866, PRO840, PR0982, PR0836, PROU59, PR01358, PR01325, 
PR01338, PR01434, PR04333, PRO4302, PRO4430, PR05727 polypeptide, comprising exposing a ceil 
suspected of containing the polypeptide to an ami- PRO200, anti-PRO204. anti-PR0212, anti-PR0216, anti- 
PR0226. anti-PRO240, anti-PR0235, anti-PR0245, anti-PR0172, anti-PR0273, anti-PR0272, anti-PR0332, 
anti-PR0526. anti-PRO701. anti-PR0361, anti-PR0362. anti-PR0363, anti-PR0364. anti-PR0356, anti- 

15 PR0531. anti-PR0533. anti-PRO1083. anu-PR0865. anu-PRO770. anti-PR0769, anu-PR0788, anti- 
PROl 1 14. anti-PRO1007. anti-PROl 184, anti-PROI031. anti-PR0l346. ami-PROI 155. anti-PROl250. and- 
PROI3I2. anti-PROU92. anii-PR01246. anti-PROl283. anti-PROl 195, anti-PR01343. ami-PRO!4l8. anti- 
PROI387. anti-PROl410, anti-PR01917, anu-PROl868, anti-PRO205, anti-PR02l, anri-PR0269, ami- 
PR0344. anti-PR0333, anti-PR0381, anti-PRO720, ami-PR0866. ami-PRO840, anti-PR0982. anti-PR0836, 

20 anu-PR0M59. ami-PR01358. ami-PRO!325, anti-PR0!338. antx-PRO!434, anti-PR04333, anti-PRO4302, 
anu-PRO4430. anti-PR05727 antibody, respectively, and determining binding of the antibody to the cell. 

13. A method of diagnosing an immune related disease in a mammal, comprising detecting the 
level of expression of a gene encoding a PRO200, PRO204, PR02I2, PR0216, PR0226. PRO240. PR0235, 

25 PR0245. PR0172. PR0273, PR0272. PR0332. PR0526. PRO70I. PR0361. PR0362. PR0363. PR0364. 
PR0356. PR053I, PR0533. PRO1083. PR0865. PRO770. PR0769. PR0788. PROl 1 14. PRO1007. 
PROl 184. PRO1031, PR01346. PROl 155. PRO1250. PR01312. PR01192. PR01246. PR01283. PROl 195, 
PR01343. PROI418. PR01387, PRO14I0, PRO 19 17. PRO 1 868, PRO205, PR021, PR0269, PR0344, 
PR0333. PR0381, PRO720. PR0866. PRO840. PR0982, PR0836. PROl 159, PR01358. PR01325. 

30 PR01338, PR01434, PR04333, PRO4302, PRO4430 or PR05727 polypeptide (a) in a test sample of tissue 
cells obtained from the mammal, and (b) in a control sample of known normal tissue ceils of the same cell 
type, wherein a higher or lower expression level in the test sample as compared to the control indicates the 
presence of immune related disease in the mammal from which the test tissue ceils were obtained. 

35 14. A method of diagnosing an immune related disease in a mammal, comprising (a) contacting 

an anti-PRO200, anti-PRO204, anti-PR0212, anti-PR0216. anti-PR0226, anu-PRO240. anti-PR0235, ami- 
PR0245, anti-PR0172, ami-PR0273, anti-PR0272, anti-PR0332, anti-PR0526 t anti-PRO70I, anti-PR0361, 
anti-PR0362, anti-PR0363, anti-PR0364, anti-PR0356, anti-PR0531, anti-PR0533, anti-PRO1083, anti- 
PR0865, anti-PRO770, anti-PR0769, anu-PR0788, anti-PROl 1 14, anri-PRO1007 t anti-PROl 184, anti- 

40 PRO1031, anti-PROl346, anti-PROl 155, anti-PRO1250, anti-PROl3l2, anti-PR01192, anti-PR01246, anti- 
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PR01283. anti-PROI 195, anti-PROl343. anti-PROI4!8, anti-PRO!387, ami-PRO1410, anti-PR0I9I7, anti- 
PROI868, anti-PRO205, anri-PR021, anti-PR0269. anti-PR0344, anti-PR0333, anti-PR0381, anci-PRO720, 
anti-PR0866, anti-PRO840. anti-PR0982. anti-PR0836, anti-PROI 159. anti-PR01358, anri-PROl325, anti- 
PR01338, anti-PRO!434 ? anti-PR04333, anti-PRO4302. anti-PRO4430 or anti-PR05727 antibody with a test 
5 sample of tissue cells obtained from the mammal, and (b) detecting the formation of a complex between the 
antibody and the polypeptide in the test sample. 

15. An immune related disease diagnostic kit. comprising an anti-PRO200 T anu-PRO204, anti- 
PR02I2. anti-PR0216, anti-PR0226. anti-PRO240, anti-PR0235, antt-PR0245, anti-PR0172, anti-PR0273, 

10 ami-PR0272, anti-PR0332. anti-PR0526, anti-PRO70l. anti-PR036l. anti-PR0362, anti-PR0363, anti- 
PR0364, anti-PR0356. anti-PR0531, anti-PR0533, anu-PRO1083, anti-PR0865, anti-PRO770, anti-PR0769, 
anti-PR0788, anti-PROI II 4, anti-PROI 007; anti-PROI 184, anti-PRO103 1. anti-PRO 1 346. anti-PROI 155, 
anti-PROI250, anti-PR013l2. anti-PROI 192, anti-PROl246, ami-PR01283, anti-PROI 195, anti-PR01343, 
anti-PR01418. anti-PROI387. anti-PROI410. anti-PR019l7, anti-PR01868. anti-PRO205. anti-PR021, anti- 

15 PR0269. ami-PR0344. anti-PR0333. anti-PR038I. anti-PRO720. anti-PR0866. anti-PRO840. anti-PR0982, 
anti-PR0836. anti-PROI 159. anti-PROI 358. anti-PRO!325. anti-PR01338. anti-PR01434. anti-PR04333, 
, anti-PRO4302. anti-PRO4430 or anti-PR05727 antibody or fragment thereof and a carrier in suitable 
packaging. 

20 16. The kit of claim 15, further comprising instructions for using the antibody to detect a 

PRO200, PRO204, PR0212. PR0216, PR0226. PRO240. PR0235, PR0245, PR0172, PR0273, PR0272, 
PR0332. PR0526, PRO701. PR0361, PR0362, PR0363. PR0364. PR0356, PR0531, PR0533, PRO 1083, 
PR0865. PRO770. PR0769. PR0788. PRO 1 1 14. PRO1007, PROI 184, PRO1031, PR01346. PROU55, 
PRO1250. PR01312, PRO 1 192. PROI246. PR01283. PROI 195. PR01343. PR01418, PR01387. PROI4I0, 

25 PR01917. PRO 1868. PRO205. PR021, PR0269. PR0344. PR0333. PR0381, PRO720. PR0866. PRO840, 
PR0982. PR0836. PRO! 159. PR01358. PR01325. PROI33S, PROI434. PR04333. PRO4302. PRO4430 or 
PR05727 polypeptide. 

17. An article of manufacture, comprising: 
30 a container; 

an instruction on the container; and 

a composition comprising an active agent contained within the container: wherein the composition is 
effective for inhibiting or reducing an immune response in a mammal, the instruction on the container indicates 
that the composition can be used for treating an immune related disease, and the active agent in the 

35 composition is an agent inhibiting the expression and/or activity of a PRO200, PRO204. PR0212, PR0216, 
PR0226. PRO240, PR0235, PR0245, PR0172, PR0273, PR0272, PR0332, PR0526, PRO701, PR0361, 
PR0362, PR0363, PR0364, PR0356, PR0531, PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788, 
PROH14, PRO1007, PROII84, PRO103I, PROI346, PROI 155, PRO1250, PR01312, PROI 192, PR01246, 
PR01283, PROI 195, PR01343, PROI4I8, PR01387, PRO1410, PR01917, PR01868, PRO205, PR021, 

40 PR0269, PR0344, PR0333. PR0381, PRO720, PR0866, PRO840, PR0982, PR0836, PROI 159, PR01358, 
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PR01325. PR01338. PR01434, PR04333. PRO4302, PRO4430 or PR05727 polypeptide. 

18. The article of manufacture of claim 17 wherein said active agent is an anti-PRO200, anti- 
PRO204. anti-PR0212, anti-PR0216, ami-PR0226, ami-PRO240, anti-PR0235, anti-PR0245, anti-PR0172 t 

5 anti-PR0273, anu-PR0272, anri-PR0332, anti-PR0526, anti-PRO701, anu-PR036l, anu-PR0362, anti- 
PR0363, anti-PR0364, anti-PR0356, anti-PR0531, anti-PR0533, anu-PRO!083, anti-PR0865, anti-PRO770, 
anti-PR0769, anti-PR0788. anti-PROl 1 14, anti-PRO1007, anti-PRO 1 1 84 ; anti-PROl031 t anti-PR01346, 
anti-PROl 155, anti-PROI250, anti-PR013I2, anti-PROl 192, anti-PRO!246. anti-PR01283. anti-PROl 195, 
ami-PR01343, anti-PR014I8, and-PROt387. anti-PROI410, anti-PR019l7, anti-PR01868, anti-PRO205, 

10 anti-PR021, anti-PR0269, anti-PR0344. anti-PR0333, anti-PR038I. anti-PRO720, anti-PR0866, anti- 
PRO840, anti-PR0982, ami-PR0836, anti-PROl 159, anti-PRO!358, anti-PROI325. anti-PROI338, anti- 
PR01434. anti-PR04333, anti-PRO4302, anti-PRO4430 or anti-PR05727 antibody. 

19. A method for identifying a compound capable of inhibiting the expression or activity of a 
15 PRO200. PRO204. PR02I2. PR0216, PR0226. PRO240. PR0235, PR0245. PR0172. PR0273, PR0272. 

. PR0332. PR0526. PRO70I. PR0361, PR0362. PR0363. PR0364. PR0356. PR0531. PR0533. PRO 1083, 
PR0865. PRO770. PR0769. PR0788, PROM 14. PRO1007. PR01I84. PRO103.1. PROl346 ? PROl 155, 
PRO1250. PR01312. PROl 192. PR01246. PR01233, PROl 195, PR01343. PR01418. PR01387. PRO1410. 
PR01917. PR01868, PRO205. PR02I, PR0269, PR0344, PR0333, PR0381. PRO720, PR0866. PRO840, 
20 PR0982. PR0836. PROl 159, PR01358, PR01325, PROI338, PROI434, PR04333. PRO4302, PRO4430 or 
PR05727 polypeptide, comprising contacting a candidate compound with the polypeptide under conditions and 
for a time sufficient to allow these two components to interact. 

20. The method of claim 19. wherein the candidate compound or the PRO200. PRO204, 
25 PR0212. PR0216. PR0226. PRO240, PR0235. PR0245. PR0172. PR0273. PR0272, PR0332. PR0526, 

PRO701. PR0361. PR0362. PR0363. PR0364. PR0356. PR053K PR0533. PRO1083. PR0865. PRO770, 
PR0769. PR0788. PROl 114. PRO1007. PROl 184. PROI031. PR01346. PROl 155. PROI250. PR01312, 
PROl 192. PR01246, PR01283, PROl 195. PROI343, PR01418. PR01387. PRO1410, PROI917, PR01868, 
PRO205. PR021, PR0269, PR0344, PR0333. PR0381, PRO720. PR0866, PRO840, PR0982, PR0836, 
30 PROl 159. PR01358. PR01325. PR01338. PR01434, PR04333, PRO4302, PRO4430 or PR05727 
polypeptide is immobilized on a solid support. 

21. The method of claim 20. wherein the non-immobilized component carries a detectable label. 

35 22. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 

sequence that encodes an amino acid sequence selected from the group consisting of the amino acid sequence 
shown in Figure 1 (SEQ ID NO:l), Figure 3 (SEQ ID NO: 11), Figure 5 (SEQ ED NO: 13), Figure 7 (SEQ ID 
NO:18), Figure 9 (SEQ ID NO:20), Figure 1 1 (SEQ ID N0:25), Figure 13 (SEQ ID NO:30), Figure 15 (SEQ 
ID NO:35), Figure 17 (SEQ ID NO:40) f Figure 19 (SEQ ID NO:45), Figure 21 (SEQ ID NO:50), Figure 23 

40 (SEQ ID NO:56) t Figure 25 (SEQ ID NO:6 1 ), Figure 27 (SEQ ID NO:66), Figure 29 (SEQ ID NO:71), Figure 
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31 (SEQ ID NO:79), Figure 33 (SEQ ID NO:86), Figure 35 (SEQ ID N0:91), Figure 37 (SEQ tD N0:101), 
Figure 39 (SEQ ID NO:106). Figure 41 (SEQ ID NO:Ill). Figure 43 (SEQ ID NO:116), Figure 45 (SEQ ID 
NO: 123), Figure 47 (SEQ ID NO:133), Figure 49 (SEQ ID NO:139), Figure 51 (SEQ ID N0:141), Figure 53 
(SEQ ID NO: 143), Figure 55 (SEQ ID NO: 145). Figure 57 (SEQ ID NO: 147), Figure 59 (SEQ DD NO: 149), 
Figure 61 (SEQ ID NO_:151), Figure 63 (SEQ ID NO:I56), Figure 65 (SEQ ID NO:158), Figure 67 (SEQ ID 
NO:160), Figure 69 (SEQ ID NO:162), Figure 71 (SEQ ID NO:167), Figure 73 (SEQ ID NO:169), Figure 75 
(SEQ ID NO:177), Figure 77 (SEQ ID NO:179), Figure 79 (SEQ ID NO:184), Figure 81 (SEQ ID NO:186), 
Figure 83 (SEQ ID NO: 188), Figure 85 (SEQ ID NO: 190), Figure 87 (SEQ ID NO: 192), Figure 89 (SEQ ID 
NO:228), Figure 91 (SEQ ID NO:230), Figure 93 (SEQ ID NO:232), Figure 95 (SEQ ID NO:240), Figure 97 
(SEQ ID NO:248). Figure 99 (SEQ ID NO:250), Figure 101 (SEQ ID NO:255), Figure 103 (SEQ ID NO:257), 
Figure 105 (SEQ ID NO:266), Figure 107 (SEQ ID N0.268), Figure 109 (SEQ ID NO:270), Figure 1 1 1 (SEQ 
ID NO:272), Figure 113 (SEQ ID NO:274), Figure 115 (SEQ ID NO:276), Figure 117 (SEQ ID NO:278), 
Figure 1 19 (SEQ ID NO:280), Figure 121 (SEQ ID NO:285), Figure 123 (SEQ ID NO:292), Figure 125 (SEQ 
ID N0.294) or Figure 1 27 (SEQ ID NO:296. 

23. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence selected from the group consisting of the nucleotide sequence shown in Figure I (SEQ ID NO:l), 
Figure 3 (SEQ ID NO: 1 1), Figure 5 (SEQ ID NO: 13), Figure 7 (SEQ ID NO: 18), Figure 9 (SEQ ID NO:20), 
Figure 11 (SEQ ID NO:25), Figure 13 (SEQ ID NO:30), Figure 15 (SEQ ID NO:35), Figure 17 (SEQ ID 
NO:40), Figure 19 (SEQ ID NO:45), Figure 21 (SEQ ID NO:50), Figure 23 (SEQ ID NO:56), Figure 25 (SEQ 
ID NO:6I), Figure 27 (SEQ ID NO:66). Figure 29 (SEQ ID NO:71), Figure 31 (SEQ ID NO:79), Figure 33 
(SEQ ID NO:86), Figure 35 (SEQ ID NO:91), Figure 37 (SEQ ID NO:I01). Figure 39 (SEQ ID NO: 106), 
Figure 41 (SEQ ID NO:lll), Figure 43 (SEQ ID NO:116), Figure 45 (SEQ ID NO:123), Figure 47 (SEQ ED 
NO: 133), Figure 49 (SEQ ID NO: 139), Figure 51 (SEQ ID NO: 141), Figure 53 (SEQ ID NO: 143), Figure 55 
(SEQ ID NO:145). Figure 57 (SEQ ID NO:147), Figure 59 (SEQ ID NO:I49), Figure 61 (SEQ ID N0:151), 
Figure 63 (SEQ ID NO:156), Figure 65 (SEQ ID NO:158), Figure 67 (SEQ ID NO:160), Figure 69 (SEQ ID 
NO:I62), Figure 71 (SEQ ID NO:167), Figure 73 (SEQ ID NO:169). Figure 75 (SEQ ID NO:I77), Figure 77 
(SEQ ID NO: 179), Figure 79 (SEQ ID NO: 184), Figure 81 (SEQ ID NO: 186), Figure 83 (SEQ ID NO: 188). 
Figure 85 (SEQ ID NO: 190), Figure 87 (SEQ ID NO: 192), Figure 89 (SEQ ID NO:228). Figure 91 (SEQ ID 
NO:230), Figure 93 (SEQ ID NO:232), Figure 95 (SEQ ID NO:240), Figure 97 (SEQ ID NO:248), Figure 99 
(SEQ ID NO:250), Figure 101 (SEQ ID NO:255), Figure 103 (SEQ ID NO:257), Figure 105 (SEQ ED 
NO:266), Figure 107 (SEQ ID NO:268), Figure 109 (SEQ ID NO:270), Figure 1 1 1 (SEQ ID NO:272), Figure 
113 (SEQ ID NO:274), Figure 1 15 (SEQ ID NO:276), Figure 117 (SEQ ID NO:278), Figure 119 (SEQ ID 
NO:280), Figure 121 (SEQ ID N0.285), Figure 123 (SEQ ID NO:292), Figure 125 (SEQ ED NO:294) or 
Figure 127 (SEQ ID NO:296. 

24. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence selected from the group consisting of the full-length coding sequence of the nucleotide sequence 
shown in Figure 1 (SEQ ID NO:l), Figure 3 (SEQ ID NO:l 1), Figure 5 (SEQ ID NO:13), Figure 7 (SEQ ED 
NO:18), Figure 9 (SEQ ED NO:20), Figure 1 1 (SEQ ID NO:25), Figure 13 (SEQ ED NO:30), Figure 15 (SEQ 
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ID N0:35), Figure 17 (SEQ ID NO:40), Figure 19 (SEQ ID NO:45), Figure 21 (SEQ ID NO:50), Figure 23 
(SEQ ID NO:56), Figure 25 (SEQ ID NO:61), Figure 27 (SEQ ID NO:66), Figure 29 (SEQ ID N0:71), Figure 
31 (SEQ ID NO:79), Figure 33 (SEQ ID NO:86) ( Figure 35 (SEQ ID N0:9l), Figure 37 (SEQ ID NO:IOI), 
Figure 39 (SEQ ED NO:106), Figure 41 (SEQ ID NO: 1 1 1), Figure 43 (SEQ ID NO:l 16), Figure 45 (SEQ ID 
5 NO:123), Figure 47 (SEQ ID NO: 133), Figure 49 (SEQ ID NO:139), Figure 51 (SEQ ID NO:141), Figure 53 
(SEQ ID NO: 143), Figure 55 (SEQ ID NO: 145), Figure 57 (SEQ ID NO: 147), Figure 59 (SEQ ID NO: 149), 
Figure 61 (SEQ ID NO: 151), Figure 63 (SEQ ID NO: 156), Figure 65 (SEQ ID NO: 158), Figure 67 (SEQ ID 
NO: 160), Figure 69 (SEQ ID NO: 162), Figure 71 (SEQ ID NO: 167), Figure 73 (SEQ ID NO: 169), Figure 75 
(SEQ ID NO: 177), Figure 77 (SEQ ID NO: 179), Figure 79 (SEQ ID NO: 184), Figure 81 (SEQ ID NO: 186), 

10 Figure 83 (SEQ ID NO: 188), Figure 85 (SEQ ID NO: 190), Figure 87 (SEQ ID NO: 192), Figure 89 (SEQ ID 
NO:228), Figure 91 (SEQ ID NO:230), Figure 93 (SEQ ID NO:232), Figure 95 (SEQ ID NO:240), Figure 97 
(SEQ ID NO:248), Figure 99 (SEQ ID NO:250), Figure 101 (SEQ ID NO:255), Figure 103 (SEQ ID NO:257), 
Figure 105 (SEQ ID NO:266), Figure 107 (SEQ ID NO:268), Figure 109 (SEQ ID NO:270), Figure 1 1 1 (SEQ 
ID NO:272), Figure 113 (SEQ ID NO:274), Figure 115 (SEQ ID NO:276), Figure 117 (SEQ ID NO:278) t 

15 Figure 1 19 (SEQ ID NO:280), Figure 121 (SEQ ID N0:285), Figure 123 (SEQ ID NO:292), Figure 125 (SEQ 
ID NO:294) or Figure 127 (SEQ ID NO:296. 

25. Isolated nucleic acid having at least 80% nucleic acid sequence identity to the full-length 
coding sequence of the DNA deposited under ATCC accession number 209653, 209380, 209254, 209381. 

20 209376, 209260, 209374. 209265, 209419, 209424, 209388, 209433. 209704, 209720, 209621, 209620, 
209616, 209436, 209422, 209702, 209480, 209859, 209774, 209801, 209802, 209849, 209905. 209950, 
209962, 209866, 203128, 209986. 203173, 203132, 203093, 203457, 203244, 203094, 203282. 203276, 

203160, 203277, 203573, 203553, , 209456, 209397, 209492, , 209808. 209802, 209750. 209858, 

203583, 209989, 203092, 203131, 203269. 203267, 203657. 203818, 203834, 23-PTA, 203906. 

25 

26. A vector comprising the nucleic acid of any one of Claims 22 to 25. 

27. The vector of Claim 26 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

30 

28. A host cell comprising the vector of Claim 26. 



29. The host cell of Claim 28, wherein said cell is a CHO cell. 
35 30. The host cell of Claim 28, wherein said cell is an £. colL 

31. The host cell of Claim 28, wherein said cell is a yeast cell. 

32. A process for producing a PRO200, PRO204, PR0212, PR0216, PR0226, PRO240, 
40 PR0235, PR0245, PROI72, PR0273, PR0272, PR0332, PR0526, PRO701, PR036I, PR0362, PR0363, 
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PR0364, PR0356, PR053 1. PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788, PROl 1 14, PRO1007, 
PROl 184, PRO1031, PR01346, PROl 155, PRO1250, PR01312, PROl 192, PR01246 t PR01283, PROl 195, 
PR01343, PR01418, PR01387, PRO1410, PROI917, PR01868, PRO205, PR021, PR0269, PR0344, 
PR0333, PR0381, PRO720, PR0866, PRO840, PR0982, PR0836, PR01159, PR01358, PR01325, 
5 PR01338, PR01434, PR04333, PRO4302, PRO4430 or PR05727 polypeptide comprising culturing the host 
cell of Claim 28 under conditions suitable for expression of said polypeptide and recovering said polypeptide 
from the cell culture. 

33. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 

10 sequence selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2), 
Figure 4 (SEQ ID NO: 12), Figure 6 (SEQ ID NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 (SEQ ID NO:21), 
Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ID NO:3l), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID 
NO:4t), Figure 20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ 
ID NO:62), Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 

15 (SEQ ID NO:87). Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO: 102), Figure 40 (SEQ ID NO: 107), 
Figure 42 (SEQ ID NO: 1 12). Figure 44 (SEQ ID NO: 117), Figure 46 (SEQ ID NO: 124), Figure 48 (SEQ ID 
NO:I34), Figure 50 (SEQ ID NO:I40), Figure 52 (SEQ ID NO: 142), Figure 54 (SEQ ID NO: 144), Figure 56 
(SEQ ID NO: 146), Figure 58 (SEQ ID NO: 148), Figure 60 (SEQ ID NO: 1 50), Figure 62 (SEQ ID NO: 152), 
Figure 64 (SEQ ID NO: 157), Figure 66 (SEQ ID NO: 159), Figure 68 (SEQ ID NO:l61), Figure 70 (SEQ ID 

20 NO: 163), Figure 72 (SEQ ID NO: 1 68), Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID NO: 178), Figure 78 
(SEQ ID NO: 180), Figure 80 (SEQ ID NO: 185), Figure 82 (SEQ ID NO: 1 87), Figure 84 (SEQ ID NO: 189), 
Figure 86 (SEQ ID NO:191), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID 
NO:23I), Figure 94 (SEQ ID NO:233), Figure 96 (SEQ ID NO:24l), Figure 98 (SEQ ID NO:249), Figure 100 
(SEQ ID NO:251), Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID 

25 NO:267), Figure 108 (SEQ ID NO:269), Figure 1 10 (SEQ ID NO:27l), Figure 1 12 (SEQ ID NO:273), Figure 
114 (SEQ ID NO:275), Figure 116 (SEQ ID NO:277), Figure 118 (SEQ ID NO:279), Figure 120 (SEQ ID 
NO:28I), Figure 122 (SEQ ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or 
Figure 128 (SEQ ID NO:297). 

30 34. An isolated polypeptide scoring at least 80% positives when compared to an amino acid 

sequence selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2), 
Figure4 (SEQ ID NO: 12), Figure 6 (SEQ ID NO:14), Figure 8 (SEQ ID NO:l9), Figure 10 (SEQ IDNO:21), 
Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ID NO:3I), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID 
NO:41), Figure 20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ 

35 ID NO:62), Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 
(SEQ ID NO:87), Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO: 102), Figure 40 (SEQ ID NO: 107), 
Figure 42 (SEQ ID NO:l 12), Figure 44 (SEQ ID NO:l 17), Figure 46 (SEQ ID NO: 124), Figure 48 (SEQ ID 
NO:134), Figure 50 (SEQ ID NO:I40), Figure 52 (SEQ ID NO:142), Figure 54 (SEQ ID NO:144), Figure 56 
(SEQ ID NO: 146), Figure 58 (SEQ ID NO: 148), Figure 60 (SEQ ID NO: 150), Figure 62 (SEQ ID NO: 152), 

40 Figure 64 (SEQ ID NO:157), Figure 66 (SEQ ID NO:159), Figure 68 (SEQ CD NO:16l), Figure 70 (SEQ ED 
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NO:163), Figure 72 (SEQ ID NO: 168), Figure 74 (SEQ ID NO:170), Figure 76 (SEQ ID NO: 178), Figure 78 
(SEQ ID NO:180), Figure 80 (SEQ ID N0:185), Figure 82 (SEQ ID NO: 187), Figure 84 (SEQ ID NO:189), 
Figure 86 (SEQ ID NO:19l), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID 
NO:231), Figure 94 (SEQ ID NO:233), Figure 96 (SEQ ID NO:24l) t Figure 98 (SEQ ID NO:249), Figure 100 
(SEQ ID NO:251), Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID 
NO:267) f Figure 108 (SEQ ID NO:269), Figure 1 10 (SEQ ID NO:271), Figure 1 12 (SEQ ID NO:273), Figure 
114 (SEQ ID NO:275), Figure 116 (SEQ ID NO:277), Figure 118 (SEQ ID NO:279), Figure 120 (SEQ ID 
NO:28l), Figure 122 (SEQ ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or 
Figure 128 (SEQ ID NO:297). 

35. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 
sequence encoded by the full-length coding sequence of the DNA deposited under ATCC accession number 
209653, 209380, 209254, 209381, 209376, 209260, 209374, 209265, 209419, 209424, 209388, 209433, 
209704, 209720. 209621, 209620, 209616, 209436, 209422, 209702. 209480, 209859, 209774. 209801, 
209802. 209849. 209905. 209950, 209962, 209866. 203128, 209986. 203173, 203132. 203093, 203457, 

203244. 203094. 203232. 203276, 203160. 203277. 203573. 203553. . 209456, 209397, 209492, , 

209808. 209802. 209750, 209858, 203583. 209989. 203092, 203131, 203269, 203267, 203657. 203818. 
203834, 23-PTA, 203906. 

20 36. A chimeric moiecuie comprising a polypeptide according to any one of Claims 33 to 35 

fused to a heterologous amino acid sequence. 

37. The chimeric molecule of Claim 36, wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

25 

38. The chimeric molecule of Claim 36. wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

39. An antibody which specifically binds to a polypeptide according to any one of Claims 33 to 

30 35. 

40. The antibody of Claim 39, wherein said antibody is a monoclonal antibody, a humanized 
antibody or a single-chain antibody. 

35 41. Isolated nucleic acid having at least 80% nucleic acid sequence identity to: 

(a) a nucleotide sequence encoding the polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 4 
(SEQ IDNO:12), Figure6 (SEQ ID NO:I4), Figure 8 (SEQ ID NO:19), Figure 10 (SEQ ID NO:21) t Figure 12 
(SEQ ID NO:26) t Figure 14 (SEQ ID NO:3l), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:41), Figure 
20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57) t Figure 26 (SEQ ID NO:62) f 

40 Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72) t Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ID 
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NO:87), Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID N0:107) ( Figure 42 
(SEQ ID N0:1 12) t Figure 44 (SEQ ID NO: 117), Figure 46 (SEQ ID NO:I24), Figure 48 (SEQ ID NO:134), 
Figure 50 (SEQ ID NO: 140), Figure 52 (SEQ ID NO:I42), Figure 54 (SEQ ED NO: 144), Figure 56 (SEQ ID 
NO: 146), Figure 58 (SEQ ID NO: 148), Figure 60 (SEQ ID NO: 150), Figure 62 (SEQ ID NO: 152), Figure 64 
5 (SEQ ID NO:157), Figure 66 (SEQ ID NO:159), Figure 68 (SEQ ID NO:161), Figure 70 (SEQ ID NO:163), 
Figure 72 (SEQ ID NO:168), Figure 74 (SEQ ID NO:170), Figure 76 (SEQ ID NO:I78), Figure 78 (SEQ ID 
NO:180), Figure 80 (SEQ ID NO:185), Figure 82 (SEQ ID NO:187), Figure 84 (SEQ ID NO:189), Figure 86 
(SEQ ID NO:191), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID NO:231), 
Figure 94 (SEQ ID NO:233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID 
10 NO:251), Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID NO:267), Figure 
108 (SEQ ID NO:269), Figure 110 (SEQ ID NO:271), Figure 112 (SEQ ID NO:273), Figure 114 (SEQ ID 
NO:275), Figure 1 16 (SEQ ID NO:277), Figure 1 18 (SEQ ID NO:279), Figure 120 (SEQ ID NO;28I), Figure 
122 (SEQ ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID 
NO:297) lacking its associated signal peptide; 

15 (b) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 

2 (SEQ ID NO:2), Figure 4 (SEQ ID NO: 12), Figure 6 (SEQ ID NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 
(SEQ ID NO:21). Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ED NO:31), Figure 16 (SEQ ID NO:36), Figure 
18 (SEQ ID NO:41), Figure 20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), 
Figure 26 (SEQ ID NO:62), Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID 

20 NO:80) ; Figure 34 (SEQ ID NO:87) ? Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO: 102), Figure 40 
(SEQ ID NO: 107), Figure 42 (SEQ ID NO:l 12), Figure 44 (SEQ ID NO: 11 7), Figure 46 (SEQ ID NO: 124), 
Figure 48 (SEQ ID NO: 134), Figure 50 (SEQ ID NO: 140), Figure 52 (SEQ ID NO: 1 42), Figure 54 (SEQ ID 
NO:I44), Figure 56 (SEQ ID NO:146), Figure 58 (SEQ ID NO:I48), Figure 60 (SEQ ID NO:150), Figure 62 
(SEQ ID NO:I52), Figure 64 (SEQ ID NO: 157), Figure 66 (SEQ ID NO:I59), Figure 68 (SEQ ID NO:161), 

25 Figure 70 (SEQ ID NO: 163), Figure 72 (SEQ ID NO: 168), Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID 
NO: 178), Figure 78 (SEQ ID NO: 180), Figure 80 (SEQ ID NO: 185), Figure 82 (SEQ ID NO: 187), Figure 84 
(SEQ ID NO:I89), Figure 86 (SEQ ID NO:I91), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), 
Figure 92 (SEQ ID NO:231), Figure 94 (SEQ ID NO:233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID 
NO:249), Figure 100 (SEQ ID NO:251), Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 

30 106 (SEQ ID NO:267), Figure 108 (SEQ ID NO:269), Figure 110 (SEQ ID NO:27i), Figure 112 (SEQ ID 
N0273), Figure 1 14 (SEQ ID NO:275), Figure 1 16 (SEQ ID NO:277), Figure 1 18 (SEQ ID NO:279), Figure 
120 (SEQ ID NO:281), Figure 122 (SEQ ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID 
NO:295) or Figure 128 (SEQ ID NO:297) with its associated signal peptide; or 

(c) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 2 

35 (SEQ ID NO:2), Figure 4 (SEQ ID NO: 12), Figure 6 (SEQ ID NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 
(SEQ ID NO:21) t Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ID NO:31), Figure 16 (SEQ ID NO:36), Figure 
18 (SEQ ID NO:41), Figure 20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), 
Figure 26 (SEQ ID NO:62), Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID 
NO:80) t Figure 34 (SEQ ID NO:87), Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO: 102), Figure 40 

40 (SEQ ID NO:107), Figure 42 (SEQ ID NO:l 12), Figure 44 (SEQ ED NO:l 17), Figure 46 (SEQ ID NO:124), 
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Figure 48 (SEQ ID NO: 134), Figure 50 (SEQ ID NO: 140), Figure 52 (SEQ ID NO: 142), Figure 54 (SEQ ID 
NO: 144), Figure 56 (SEQ ID NO: 146), Figure 58 (SEQ ID NO: 148), Figure 60 (SEQ ID NO: 150), Figure 62 
(SEQ ID NO:152), Figure 64 (SEQ ID NO:157), Figure 66 (SEQ ID NO:159), Figure 68 (SEQ ID NO:161), 
Figure 70 (SEQ ID NO: 163), Figure 72 (SEQ ID NO: 168), Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID 
5 NO:178), Figure 78 (SEQ ID NO:180), Figure 80 (SEQ ID NO:185), Figure 82 (SEQ ED NO:187), Figure 84 
(SEQ ID NO:189), Figure 86 (SEQ ID NO:191), Figure 88 (SEQ ID NO: 193), Figure 90 (SEQ ID NO:229), 
Figure 92 (SEQ ID NO:231), Figure 94 (SEQ ID NO:233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID 
NO:249), Figure 100 (SEQ ID NO:25I), Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 
106 (SEQ ID NO:267), Figure 108 (SEQ ID NO:269), Figure 110 (SEQ ID NO:271), Figure 112 (SEQ ID 
10 NO:273), Figure 1 14 (SEQ ID NO:275), Figure 1 16 (SEQ ID NO:277), Figure 1 18 (SEQ ID NO:279), Figure 
120 (SEQ ID NO:281), Figure 122 (SEQ ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID 
NO:295) or Figure 128 (SEQ ID NO:297) lacking its associated signal peptide. 

42. An isolated polypeptide having at least 80% amino acid sequence identity to: 

15 ( a ) th c polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO: 12), Figure 6 

(SEQ ID NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 (SEQ ID NO:2i), Figure 12 (SEQ ID NO:26), Figure 
14 (SEQ ID NO:3i), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:41), Figure 20 (SEQ ID NO:46) f 
Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ ID NO:62), Figure 23 (SEQ ID 
NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ID NO:87), Figure 36 (SEQ 

20 ID NO:92), Figure 38 (SEQ ID NO: 102), Figure 40 (SEQ ID NO: 107), Figure 42 (SEQ ID NO:l 12), Figure 44 
(SEQ ID NO:I17), Figure 46 (SEQ ID NO:124), Figure 48 (SEQ ID NO:134), Figure 50 (SEQ ID NO:140), 
Figure 52 (SEQ ID NO: 142), Figure 54 (SEQ ID NO: 144), Figure 56 (SEQ ID NO: 146), Figure 58 (SEQ ID 
NO: 148), Figure 60 (SEQ ID NO: 150), Figure 62 (SEQ ID NO: 1 52), Figure 64 (SEQ ID NO: 157), Figure 66 
(SEQ ID NO:159), Figure 68 (SEQ ID NO:16I), Figure 70 (SEQ ID NO:163), Figure 72 (SEQ ID NO:l68), 

25 Figure 74 (SEQ ID NO: 1 70), Figure 76 (SEQ ID NO: 178), Figure 78 (SEQ ID NO: 180), Figure 80 (SEQ ID 
NO: 185), Figure 82 (SEQ ID NO: 1 87), Figure 84 (SEQ ID NO: 1 89), Figure 86 (SEQ ID NO: 191), Figure 88 
(SEQ ID NO: 193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID NO:23I), Figure 94 (SEQ ID NO:233), 
Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID NO:25i), Figure 102 (SEQ ID 
NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID NO:267), Figure 108 (SEQ ID NO:269), Figure 

30 1 10 (SEQ ID NO:271), Figure 1 12 (SEQ ID NO:273), Figure 1 14 (SEQ ID NO:275), Figure 1 16 (SEQ ID 
NO:277), Figure 1 18 (SEQ ID NO:279), Figure 120 (SEQ ID NO:281), Figure 122 (SEQ ID NO:286), Figure 
124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID NO:297), lacking its associated 
signal peptide; 

(b) an extracellular domain of the polypeptide shown in Figure 2 (SEQ ED NO:2), Figure 4 (SEQ 
35 ID NO: 12), Figure 6 (SEQ ED NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 (SEQ ID NO:21), Figure 12 (SEQ 
ID NO:26), Figure 14 (SEQ ID NO:3I), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:41), Figure 20 
(SEQ ED NO:46), Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ ID NO:62), Figure 
28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ED NO:87) F 
Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID NO:107), Figure 42 (SEQ ED 
40 NO: 112), Figure 44 (SEQ ID NO: 1 17), Figure 46 (SEQ ID NO: 124), Figure 48 (SEQ ID NO: 134), Figure 50 
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(SEQ CD NO: 140), Figure 52 (SEQ ID NO: 142), Figure 54 (SEQ II* NO: 144), Figure 56 (SEQ ID NO: 146), 
Figure 58 (SEQ ID NO: 148), Figure 60 (SEQ ID NO: 150), Figure 62 (SEQ ID NO: 152), Figure 64 (SEQ ID 
NO:l57), Figure 66 (SEQ ID NO:159), Figure 68 (SEQ ID NO:161), Figure 70 (SEQ ID N0:163), Figure 72 
(SEQ ID NO:168), Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID NO: 178), Figure 78 (SEQ ID NO: 180), 
5 Figure 80 (SEQ ED N0:185), Figure 82 (SEQ ID NO: 187), Figure 84 (SEQ ID NO:189), Figure 86 (SEQ CD 
NO:191), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID NO:231), Figure 94 
(SEQ ID NO:233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID NO:251), 
Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID NO:267), Figure 108 (SEQ 
ID NO:269), Figure 110 (SEQ ID NO:27l), Figure 112 (SEQ ID NO:273), Figure 114 (SEQ ID NO:275), 

10 Figure 1 16 (SEQ ID NO:277), Figure 1 18 (SEQ ID NO:279), Figure 120 (SEQ ID NO:281), Figure 122 (SEQ 
ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID NO:297), 
with its associated signal peptide; or 

(c) an extracellular domain of the polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ 
ID NO: 12), Figure 6 (SEQ ID NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 (SEQ ID NO:21), Figure 12 (SEQ 

15 ID NO:26), Figure 14 (SEQ ID NO:31), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:41), Figure 20 
(SEQ ID NO:46), Figure 22 (SEQ ID NO: 51), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ ID NO:62), Figure 
28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ID NO:87), 
Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID NO: 107), Figure 42 (SEQ ID 
NO: 1 12), Figure 44 (SEQ ID NO:l 17), Figure 46 (SEQ ID NO: 1 24), Figure 48 (SEQ ID NO: 134), Figure 50 

20 (SEQ ID NO: 140), Figure 52 (SEQ ID NO: 142), Figure 54 (SEQ ID NO: 144), Figure 56 (SEQ ID NO: 146), 
Figure 58 (SEQ ID NO: 148), Figure 60 (SEQ ID NO: 150), Figure 62 (SEQ ID NO: 152), Figure 64 (SEQ ID 
NO:157), Figure 66 (SEQ ID NO: 159), Figure 68 (SEQ ID NO: 161), Figure 70 (SEQ ID NO: 163), Figure 72 
(SEQ ID NO: 168), Figure 74 (SEQ ID NO: 1 70), Figure 76 (SEQ ID NO: 1 78), Figure 78 (SEQ ID NO: 180), 
Figure 80 (SEQ ID NO:185), Figure 82 (SEQ ID NO:187), Figure 84 (SEQ ID NO:189), Figure 86 (SEQ ID 

25 NO: 19 1), Figure 88 (SEQ ID NO: 193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID NO:231), Figure 94 
(SEQ ID NO:233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID NO:25I), 
Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID NO:267), Figure 108 (SEQ 
ID NO:269), Figure 1 10 (SEQ ID NO:271), Figure 112 (SEQ ID NO:273), Figure 114 (SEQ ID NO:275), 
Figure 1 16 (SEQ ID NO:277), Figure 1 18 (SEQ ID NO:279), Figure 120 (SEQ ID NO:281), Figure 122 (SEQ 

30 ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ED NO:295) or Figure 128 (SEQ ID NO:297), 
lacking its associated signal peptide. 

43. A method of affecting the proliferation of T-cells comprising contacting PBMC cells with an 
effective amount of a PRO200, PRO204, PR0212, PR0216, PR0226, PRO240, PR0235, PR0245, PR0172, 

35 PR0273, PR0272, PR0332, PR0526, PRO701, PR0361, PR0362, PR0363, PR0364, PR0356, PR0531, 
PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788, PROII14, PRO1007, PR01184, PRO1031, 
PR01346, PR01155, PRO1250, PR01312, PR01192, PR01246, PRO 1283, PR01195, PR01343, PR01418, 
PR01387, PRO1410, PR01917, PR01868, PRO205, PR021, PR0269, PR0344, PR0333, PR0381, PRO720, 
PR0866, PRO840, PR0982, PR0836, PR01159, PR01358, PR01325, PR01338, PR01434, PR04333, 

40 PRO4302, PRO4430 or PR05727 polypeptide and measuring the change in proliferation from control levels. 
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44. A method of affecting vascular permeability comprising injecting a test animal with an 
effective amount of a PRO200, PRO204, PR0212, PR0216, PR0226, PRO240, PR0235, PR0245, PR0172, 
PR0273, PR0272, PR0332, PR0526, PRO701, PR036I, PR0362, PR0363, PR0364, PR0356, PR0531, 
5 PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788, PROH14 t PRO1007, PROl 184, PRO1031, 
PR01346, PR01155, PRO1250, PR01312, PROl 192, PR01246, PR01283, PROl 195, PR01343, PR01418, 
PR01387, PRO1410, PR01917, PR01868, PRO205, PR021, PR0269, PR0344, PR0333, PR0381, PRO720, 
PR0866, PRO840, PR0982, PR0836, PROl 159, PR01358, PR01325, PR01338, PR01434, PR04333, 
PRO4302, PRO4430 or PR05727 polypeptide, and measuring the resulting extent of vascular permeability. 
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Figure 1 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTC 
CAGGTTTTGCTTTGATCCTTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAAAG 
TTTTGGATGGGATTATGTGGAAACTACCCTGCGATTCTCTGCTGCCAGAGCAGGCTCGGC 
GCTTCCACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGACTCGGGAGTCGCTGCT 
TCCAAAGTGCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAA 

ATGA GCCTCTTCGGGCTTCTCCTGCTGACATCTGCCCTGGCCGGCCAGAGACAGGGGACT 
CAGGCGGAATCCAACCTGAGTAGTAAATTCCAGTTTTCCAGCAACAAGGAACAGAACGGA 
GTACAAGATCCTCAGCATGAGAGAATTATTACTGTGTCTACTAATGGAAGTATTCACAGC 
CCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTA 
GAGGAAAATGTATGGATACAACTTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAA 
GATGACATATG CAAGTATGATTTTGTAGAAGTTGAGGAACC CAGTGATGGAACTATATTA 
GGGCGCTGGTGTGGTTCTGGTACTGTACCAGGAAAACAGATTTCTAAAGGAAATCAAATT 
AGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGTTCTGCATCCACTAC 
AACATTGTCATGCCACAATTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTCAGCT 
TTGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTACCTTGGAAGACCTTATT 
CGATAT CTTGAAC CAGAGAGATGGCAGTTGGACTTAGAAGATCTATATAGGCCAACTTGG 
CAACTTCTTGGCAAGGCTTTTGTTTTTGGAAGAAAATCCAGAGTGGTGGATCTGAACCTT 
CTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGG 
GAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGT 
GGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAA 
GTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTG 
CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGA 
GGGAGCACAGGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGC 
AGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGC 
TTC^GGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 
AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCT 
TCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTT 
ACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTT 

CGATACGG CTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCAC CTGATTC 
CGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGA 

AACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGG 
ACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTAC 
ATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGA 
TAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACT 
GTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTT 
TTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTA 
TAGCC^GAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTC 
ATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTrAAAAAACTGAATTGGAATAGA 
ATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTC CATTC CTGTT 
ATTGGAGATGAAAATAAAAAGCAAGTTATGAAAGTAGACATTCAGATCCAGCCATTACTA 
ACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAA 
AAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTT 
ATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTGTATAAATACATG 
GATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGCAATT 
TAAAAGAAAATCAGTAAAATATTTTGCTTGTAAAATGCTTAATATNGTGCCTAGGTTATG 
TGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGTGGCTATTTTG 
GGGAGAAAATTAAAAAAAATiAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 
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Figure 2 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSfflS 

PRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTIL 

GRWCGSGTVPGKQISKGNQIRIRFVSDEVFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSA 

LPLDLLNNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRVVDLNL 

LTEEVRLYSCTPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSK 

VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 
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Figure 3 

' TGCCGGGCTGCGGGGCGCCTTGACTCTCCCTCCACCCTGCCTCCTCGGGCTCCACTCGTC 
TGCCCCTGGACTCCCGTCTCCTCCTGTCCTCCGGCTTCCCAGAGCTCCCTCCTTATGGCA 
GCAGCTTCCCGCGTCTCCGGCGCAGCTTCTCAGCGGACGACCCTCTCGCTCCGGGGCTGA 
GCCCAGTCCCTGGATGTTGCTGAAACTCTCGAGATCATGCGCGGGTTTGGCTGCTGCTTC 
CCCGCCGGGTGCCACTGCCACCGCCGCCGCCTCTGCTGCCGCCGTCCGCGGGATGCTCAG 
TAGCCCGCTGCCCGGCCCCCGCGATCCTGTGTTCCTCGGAAGCCGTTTGCTGCTGCAGAG 
TTGCACGAACTAGTC 

ATGGTGCTGTGGGAGTCCCCGCGGCAGTGCAGCAGCTGGACACTTTGCGAGGGCTTTTGC 

TGGCTGCTGCTGCTGCCCGTCATGCTACTCATCGTAGCCCGCCCGGTGAAGCTCGCTGCT 

TTCCCTACCTCCTTAAGTGACTGCCAAACGCCCACCGGCTGGAATTGCTCTGGTTATGAT 

GACAGAGAAAATGATCTCTTCCTCTGTGACACCAACACCTGTAAATTTGATGGGGAATGT 

TTAAGAATTGGAGACACTGTGACTTGCGTCTGTCAGTTCAAGTGCAACAATGACTATGTG 

CCTGTGTGTGGCTCCAATGGGGAGAGCTACCAGAATGAGTGTTACCTGCGACAGGCTGCA 

TGCAAACAGCAGAGTGAGATACTTGTGGTGTCAGAAGGATCATGTGCCACAGATGCAGGA 

TCAGGATCTGGAGATGGAGTCCATGAAGGCTCTGGAGAAACTAGTCAAAAGGAGACATCC 

ACCTGTGATATTTGCCAGTTTGGTGCAGAATGTGACGAAGATGCCGAGGATGTCTGGTGT 

GTGTGTAATATTGACTGTTCTCAAACCAACTTCAATCCCCTCTGCGCTTCTGATGGGAAA 

TCTTATGATAATGCATGCCAAATCAAAGAAGCATCGTGTCAGAAACAGGAGAAAATTGAA 

GTCATGTCTTTGGGTCGATGTCAAGATAACACAACTACAACTACTAAGTCTGAAGATGGG 

CATTATGCAAGAACAGATTATGCAGAGAATGCTAACAAATTAGAAGAAAGTGCCAGAGAA 

CACCACATACCTTGTCCGGAACATTACAATGGCTTCTGCATGCATGGGAAGTGTGAGCAT 

TCTATCAATATGCAGGAGCCATCTTGCAGGTGTGATGCTGGTTATACTGGACAACACTGT 

GAAAAAAAGGACTACAGTGTTCTATACGTTGTTCCCGGTCCTGTACGATTTCAGTATGTC 

TTAATCGCAGCTGTGATTGGAACAATTCAGATTGCTGTCATCTGTGTGGTGG 

TCCTCTGCATCACAAGGAAATGCCCCAGAAGCAACAGAATTCACAGACAGAAGCAAAATA 

C^GGGCACTACAGTTCAGACAATACAACAAGAGCGTCCACGAGGTTAATC TAAA GGGAGC 

ATGTTTCACAGTGGCTGGACTACCGAGAGCTTGGACTACACAATACAGTATTATAGACAA 

AAGAATAAGACAAGAGATCTACACATGTTGCCTTGCATTTGTGGTAATCTACACCAATGA 

AAACATGTACTACAGCTATATTTGATTATGTATGGATATATTTGAAATAGTATACATTGT 

CTTGATGTTTTTTCTGTAATGTAAATAAACTATTTATATCACACAATATAGTTTTTTCTT 

TCCCATGTATTTGTTATATATAATAAATACTCAGTGATGAG 
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Figure 4 

MVLWESPRQCSSWTLCEGFCWLLLLPVMLLIVARPVKLAAFPTSLSDCQTPTGWNCSGY 
DDRENDLFLCDTNTCKFDGECLRIGDTVTCVCQFKCNNDYVPVCGSNGESYQNECYLRQ 
AACKQQSEILWSEGSCATDAGSGSGDGVHEGSGETSQKETSTCDICQFGAECDEDAED 
VWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNTTTTTK 
SEDGHYARTDYAENANKLEESAREHHI PCPEHYNGFCMHGKCEHS INMQEPSCRCDAGY 
TGQHCEKKDYSVLYVVPGPVRFQYVLIAAVIGTIQIAVICVVVLCITRKCPRSNRIHRQ 
KQNTGHYS S DNTTRASTRL I 
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Figure 5 

tccgcaggcggaccgggggcaaaggaggtggcatgtcggtcaggcacagcagggtcctgt 

gtccgcgctgagccgcgctctccctgctccagcaXggacc 

atgagggcgctggaggggccaggcctgtcgctgctgtgcctggtgttggcgctgcctgcc 

ctgctgccggtgccggctgtacgcggagtggcagaaacacccacctacccctggcgggac 

gcagagacaggggagcggctggtgtgcgcccagtgccccccaggcacctttgtgcagcgg 

ccgtgccgccgagacagccccacgacgtgtggcccgtgtccaccgcgccactacacgcag 

ttctggaactacctggagcgctgccgctactgcaacgtcctctgcggggagcgtgaggag 

gaggcacgggcttgccacgccacccacaaccgtgcctgccgctgccgcaccggcttcttc 

GCGCACGCTGGTTTCTGCTTGGAGCACGCATCGTGTCCACCTGGTGCCGGCGTGATTGCC 
CCGGGCACCCCCAGCCAGAACACGCAGTGCCAGCCGTGCCCCCCAGGCACCTTCTCAGCC 
AGCAGCTCCAGCTCAGAGCAGTGCCAGCCCCACCGCAACTGCACGGCCCTGGGCCTGGCC 
CTCAATGTGCCAGGCTCTTCCTCCCATGACACCCTGTGCACCAGCTGCACTGGCTTCCCC 
CTCAGCACCAGGGTACCAGGAGCTGAGGAGTGTGAGCGTGCCGTCATCGACTTTGTGGCT 
TTCCAGGACATCTCCATCAAGAGGCTGCAGCGGCTGCTGCAGGCCCTCGAGGCCCCGGAG 
GGCTGGGGTCCGACACCAAGGGCGGGCCGCGCGGCCTTGCAGCTGAAGCTGCGTCGGCGG 
CTCACGGAGCTCCTGGGGGCGCAGGACGGGGCGCTGCTGGTGCGGCTGCTGCAGGCGCTG 
CGCGTGGCCAGGATGCCCGGGCTGGAGCGGAGCGTCCGTGAGCGCTTCCTCCCTGTGCAC 
TGATCCTGGCCCCCTCTTATTTATTCTACATCCTTGGCACCCCACTTGCACTGAAAGAGG 
CTTTTTTTTAAATAGAAGAAATGAGGTTTCTTAAAAAAAAAAAAAAAAAAAAAA 
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Figure 6 

MRALEGPGLSLLCLVLALPALLPVPAVRGVAETPTYPWRDAETGERLVCAQCPPGTFVQ 
RPCRRDSPTTCGPCPPRHYTQFWNYLERCRYCNVLCGEREEEARACHATHNRACRCRTG 
FFAHAGFCLEHASCPPGAGVIAPGTPSQNTQCQPCPPGTFSASSSSSEQCQPHRNCTAL 
GLALNVPGSSSHDTLCTSCTGFPLSTRVPGAEECERAVIDFVAFQDISIKRLQRLLQAL 
EAPEGWGPTPRAGRAALQLKLRRRLTELLGAQDGALLVRLLQALRVARMPGLERSVRER 
FLPVH 
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Figure 7 

CACAAGCATCTTAATTTGAATCGACAAAGTTT 

AATTCAACCCGAGTGTTTTCCAAGAAGATTGTATTTGCTTAAATTGCTACAGTAATTCAA 
GAGACAGCCCTGTCTGGACACAGAGTTACTGTGGATTTTTAAGAGACTCAGTTAAAGAAT 
TTAGGAATTTCTGATTCATTTAAAGGATTTACAAATTCATCAACCCCTGAAAACTAAAGC 
AAATTGAACAGGAAAAAAAAAAAGAAG 

ATGGGTTTTTTAAGTCCAATATATGTTATTTTCTTCTTTTTTGGAGTCAAAGTACATTGC 

CAATATGAAACTTATCAGTGGGATGAAGACTATGACCAAGAGCCAGATGATGATTACCAA 

ACAGGATTCCCATTTCGTCAAAATGTAGAGTACGGAGTTCCTTTTCATCAGTATACTTTA 

GGCTGTGTCAGTGAATGCTTCTGTCCAACTAACTTTCCATCATCAATGTACTGTGATAAT 

CGCAAACTCAAGACTATCCCAAATATTCCGATGCACATTCAGCAACTCTACCTTCAGTTC 

AATGAAATTGAGGCTGTGACTGCAAATTCATTCATCAATGCAACTCATCTTAAAGAAATT 

AACCTCAGCCACAACAAAATTAAATCTCAAAAGATTGATTATGGTGTGTTTGCTAAGCTT 

CCAAATCTACTACAACTTCATCTAGAGCATAATAATTTAGAAGAATTTCCATTTCCTCTT 

CCTAAATCTCTGGAAAGACTCCTTCTTGGTTACAATGAAATCTCCAAACTGCAGACAAAT 

GCTATGGATGGGCTAGTAAACTTGACCATGCTTGATCTCTGTTATAATTATCTTCATGAT 

TCTCTGCTAAAAGACAAAATCTTTGCCAAAATGGAAAAACTAATGCAGCTCAACCTCTGC 

AGTAACAGATTAGAATCAATGCCTCCTGGTTTGCCTTCTTCACTTATGTATCTGTCTTTA 

GAAAATAATTCAATTTCTTCTATACCCGAAAAATACTTCGACAAACTTCCAAAACTTCAT 

ACTCTAAGAATGTCACACAACAAACTACAAGACATCCCATATAATATTTTTAATCTTCCC 

AACATTGTAGAACTCAGTGTTGGACACAACAAATTGAAGCAAGCATTCTATATTCCAAGA 

AATTTGGAACACCTATAC CTACAAAATAATGAAATAGAAAAGATGAAT CTTACAGTGATG 

TGTCCTTCTATTGACCCACTACATTACCACCATTTAACATACATTCGTGTGGACCAAAAT 

AAACTAAAAGAACCAATAAGCTCATACATCTTCTTCTG CTTCC CTCATATACACACTATT 

TATTATGGTGAACAACGAAGCACTAATGGTCAAACAATACAACTAAAGACACAAGTTTTC 

AGGAGATTTCCAGATGATGATGATGAAAGTGAAGATCACGATGATCCTGACAATGCTCAT 

GAGAGCCCAGAACAAGAAGGAGCAGAAGGGCACTTTGACCTTCATTATT 

ATGAAAATCAAGAATAGCAAGAAACTATATAGGTATACACTTACGACTTCACAAAACCTA 

TACTTAATATAGTAAATCTAAGTAAACATGTATTACTCAAAGTAATATATTTAGAATTAT 

GTATTAGTATAAGATCAGAATTGAATTTAAGTTGTTGGTGACATCTGCATCATTTCATAG 

GATTAGAACTTACTCAAAATAATGTAAATCTTTAAAAATATAAATTAGAATGACAAGTGG 

GAATCATAAATTAAACGTTAATGGTTTCTTATGCTCTTTTTAAATATAGAAATATCATGT 

TAAAGAAAAAAAAAAAAAA 
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Figure 8 

MGFLSPIYVIFFFFGVKVHCQYETYQWDEDYDQEPDDDYQTGFPFRQNVDYGVPFHQYTL 
GCVSECFCPTNFPSSMYCDNRKLKTIPNIPMHIQQLYLQFNEIEAVTANSFINATHLKEI 
NLSHNKIKSQKIDYGVFAKLPNLLQLHLEHNNLEEFPFPLPKSLERLLLGYNEISKLQTN 
AMDGLVNLTMLDLCYNYLHDSLLKDKI FAKMEKLMQLNLCSNRLESMPPGLPSSLMYLSL 
ENNSISSIPEKYFDKLPKLHTLRMSHNKLQDIPYNIFNLPNIVELSVGHNKLKQAFYIPR 
NLEHLYLQNNEI EKMNLTVMCP S I DPLHYHHLTYI RVDQNKLKE P I SS YI FFCFPHIHTI 
YYGEQRSTNGQTIQLKTQVFRRFPDDDDESEDHDDPDNAHESPEQEGAEGHFDLHYYENQ 

E 
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Figure 9 

CCCAAGCCAGCCGAGCCGCCAGAGCCGCGGGCCGCGGGGGTGTCGCGGGCCCAACCCCAG 
G 

ATGCTCCCCTGCGCCTCCTGCCTACCCGGGTCTCTACTGCTCTGGGCGCTGCTACTGTTG 
CTCTTGGGATCAGCTTCTCCTCAGGATTCTGAAGAGCCCGACAGCTACACGGAATGCACA 
GATGGCTATGAGTGGGACCCAGACAGCCAGCACTGCCGGGATGTCAACGAGTGTCTGACC 
ATCCCTGAGGCCTGCAAGGGGGAAATGAAGTGCATCAACCACTACGGGGGCTACTTGTGC 
CTGCCCCGCTCCGCTGCCGTCATCAACGACCTACATGGCGAGGGACCCCCGCCACCAGTG 
CCTCCCGCTCAACACCCCAACCCCTGCCCACCAGGCTATGAGCCCGACGATCAGGACAGC 
TGTGTGGATGTGGACGAGTGTGCCCAGGCCCTGCACGACTGTCGCCCCAGCCAGGACTGC 
CATAACTTGCCTGGCTCCTATCAGTGCACCTGCCCTGATGGTTACCGCAAGATCGGGCCC 
GAGTGTGTGGACATAGACGAGTGCCGCTACCGCTACTGCCAGCACCGCTGCGTGAACCTG 
CCTGGCTCCTTCCGCTGCCAGTGCGAGCCGGGCTTCCAGCTGGGGCCTAACAACCGCTCC 
TGTGTTGATGTGAACGAGTGTGACATGGGGGCCCCATGCGAGCAGCGCTGCTTCAACTCC 
TATGGGACCTTCCTGTGTCGCTGCCACCAGGGCTATGAGCTGCATCGGGATGGCTTCTCC 
TGCAGTGATATTGATGAGTGTAGCTACTCCAGCTACCTCTGTCAGTACCGCTGCGTCAAC 
GAGCCAGGCCGTTTCTCCTGCCACTGCCCACAGGGTTACCAGCTGCTGGCCACACGCCTC 
TGCCAAGACATTGATGAGTGTGAGTCTGGTGCGCACCAGTGCTCCGAGGCCCAAACCTGT 
GTCAACTTCCATGGGGGCTACCGCTGCGTGGACACCAACCGCTGCGTGGAGCCCTACATC 
CAGGTCTCTGAGAACCGCTGTCTCTGCCCGGCCTCCAACCCTCTATGTCGAGAGCAGCCT 
TCATCCATTGTGCACCGCTACATGACCATCACCTCGGAGCGGAGCGTGCCCGCTGACGTG 
TTCCAGATCCAGGCGACCTCCGTCTACCCCGGTGCCTACAATGCCTTTCAGATCCGTGCT 
GGAAACTCGCAGGGGGACTTTTACATTAGGCAAATCAACAACGTCAGCGCCATGCTGGTC 
CTCGCCCGGCCGGTGACGGGCCCCCGGGAGTACGTGCTGGACCTGGAGATGGTCACCATG 
AATTCCCTCATGAGCTACCGGGCCAGCTCTGTACTGAGGCTCACCGTCTTTGTAGGGGCC 
TACACCTTCTGAGGAGCAGGAGGGAGCCACCCTCCCTGCAGCTACCCTAGCTGAGGAGCC 
TGTTGTGAGGGGCAGAATGAGAAAGGCAATAAAGGGAGAAAGAAAGTCCTGGTGGCTGAG 
GTGGGCGGGTCACACTGCAGGAAGCCTCAGG 

CTGGGGCAGGGTGGCACTTGGGGGGGCAGGCCAAGTTCACCTAAATGGGGGTCTCTATAT 
GTTCAGGCCCAGGGGCCCCCATTGACAGGAGCTGGGAGCTCTGCACCACGAGCTTCAGTC 
ACCCCGAGAGGAGAGGAGGTAACGAGGAGGGCGGACTCCAGGCCCCGGCCCAGAGATTTG 
GACTTGGCTGGCTTGCAGGGGTCCTAAGAAACTCCACTCTGGACAGCGCCAGGAGGCCCT 
GGGTTCCATTCCTAACTCTGCCTCAAACTGTACATTTGGATAAGCCCTAGTAGTTCCCTG 
GGCCTGTTTTTCTATAAAACGAGGCAACTGGAAAAAAAAAAAA 
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Figure 10 

MLPCASCLPGSLLLWALLLLLLGSASPQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLT 
IPEACKGEMKCINHYGGYLCLPRSAAVINDLHGEGPPPPVPPAQHPNPCPPGYEPDDQDS 
CVDVDECAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNL 
PGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFS 
CSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLLATRLCQDIDECESGAHQCSEAQTC 
VNFHGGYRCVDTNRCVEPYI QVSENRCLCPASNPLCREQPS S I VHRYMTITSERSVPADV 
FQ I QATS VY PGAYNAFQ I RAGNSQGDFY I RQ I NNVS AML VLAR PVTGPRE YVLDLEMVTM ■ 
NSLMSYRASSVLRLTVFVGAYTF 
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Figure 11 

GGGAACGGAAA 

ATGGCGCCTCACGGCCCGGGTAGTCTTACGACCCTGGTGCCCTGGGCTGCCGCCCTGCTC 
CTCGCTCTGGGCGTGGAAAGGGCTCTGGCGCTACCCGAGATATGCACCCAATGTCCAGGG 
AGCGTGCAAAATTTGTCAAAAGTGGCCTTTTATTGTAAAACGACACGAGAGCTAATGCTG 
CATGCCCGTTGCTGCCTGAATCAGAAGGGCACCATCTTGGGGCTGGATCTCCAGAACTGT 
TCTCTGGAGGACCCTGGTCCAAACTTTCATCAGGCACATACCACTGTCATCATAGACCTG 
CAAGCAAACCCCCTCAAAGGTGACTTGGCCAACACCTTCCGTGGCTTTACTCAGCTCCAG 
ACTCTGATACTGCCACAACATGTCAACTGTCCTGGAGGAATTAATGCCTGGAATACTATC 
ACCTCTTATATAGACAACCAAATCTGTCAAGGGCAAAAGAACCTTTGCAATAACACTGGG 
GACC CAGAAATGTGTCCTGAGAATGGATCTTGTGTAC CTGATGGTCCAGGT CTTTTGCAG 
TGTGTTTGTGCTGATGGTTTCCATGGATACAAGTGTATGCGCCAGGGCTCGTTCTCACTG 
CTTATGTTCTTCGGGATTCTGGGAGCCACCACTCTATCCGTCTCCATTCTGGTTTGGGCG 
ACCCAGCGCCGAAAAGCCAAGACTTC ATGA ACTACATAGGTCTTACCATTGACCTAAGAT 
CAATCTGAACTATCTTAGCCCAGTCAGGGAGCTCTGCTTCCTAGAAAGGCATCTTTCGCC 
AGTGGATTCGCCTCAAGGTTGAGGCCGCCATTGGAAGATGAAAAATTGCACTCCCTTGGT 
GTAGACAAATACCAGTTCCCATTGGTGTTGTTGCCTATAATAAACACTTTTTCTTTTTTN 
AAAAAAAAAAAAAAAAAAAAA 
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Figure 12 

MAPHGPGSLTTLVPWAAALLLALGVERAIJ^LPEICTQCPGSVQNLSKVAFYCKTTRELML 
HARCCLNQKGTILGLDLQNCSLEDPGPNFHQAHTI^IlbLQANPLKGDLANTPRGFTQLQ 
TLILPQHVNCPGGINAWNTITSYIDNQICQGQKNLCNNTGDPEMCPENGSCVPDGPGLLQ 
CVCADGFHGYKCMRQGSFSLLMFFGILGATTLSVSILLWATQRRKAKTS 
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Figure 13 

CGAGGGCTTTTCCGGCTCCGGAATGGCACATGTGGGAATCCCAGTCTTGTTGGCTACAAC 
ATTTTTCCCTTTCCTAACAAGTTCTAACAGCTGTTCTAACAGCTAGTGATCAGGGGTTCT 
TCTTGCTGGAGAAGAAAGGGCTGAGGGCAGAGCAGGGCACTCTCACTCAGGGTGACCAGC 
TCCTTGCCTCTCTGTGGATAACAGAGCATGAGAAAGTGAAGAGATGCAGCGGAGTGAGGT 
GATGGAAGTCTAAAATAGGAAGGAATTTTGTGTGCAATATCAGACTCTGGGAGCAGTTGA 
CCTGGAGAGCCTGGGGGAGGGCCTGCCTAACAAGCTTTCAAAAAACAGGAGCGACTTCCA 
CTGGGCTGGGATAAGACGTGCCGGTAGGATAGGGAAGACTGGGTTTAGTCCTAATATCAA 
ATTGACTGGCTGGGTGAACTTCAACAGCCTTTTAACCTCTCTGGGAGATGAAAACGATGG 
CTTAAGGGGCCAGAAATAGAGATGCTTTGTAAAATAAAATTTTAAAAAAAGCAAGTATTT 
TATAGCATAAAGGCTAGAGACCAAAATAGATAACAGGATTCCCTGAACATTCCTAAGAGG 
GAGAAAGTATGTTAAAAATAGAAAAACCAAAATGCAGAAGGAGGAGACTCACAGAGCTAA 
ACCAGG 

ATGG GGACCCTGGGTCAGGCCAGCCTCTTTGCTCCTCCCGGAAATTATTTTTGGTCTGAC 

CACTCTGCCTTGTGTTTTGCAGAATCATGTGAGGGCCAACCGGGGAAGGTGGAGCAGATG 

AGCACACACAGGAGCCGTCTCCTCACCGCCGCCCCTCTCAGCATGGAACAGAGGCAGCCC 

TGGCCCCGGGCCCTGGAGGTGGACAGCCGCTCTGTGGTCCTGCTCTCAGTGGTCTGGGTG 

CTGCTGGCCCCCCCAGCAGCCGGCATGCCTCAGTTCAGCACCTTCCACTCTGAGAATCGT 

GACTGGACCTTCAACCACTTGACCGTCCACCAAGGGACGGGGGCCGTCTATGTGGGGGCC 

ATCAACCGGGTCTATAAGCTGACAGGCAACCTGACCATCCAGGTGGCTCATAAGACAGGG 

CCAGAAGAGGACAACAAGTCTCGTTACCCGCCCCTCATCGTGCAGCCCTGCAGCGAAGTG 

CTCACCCTCACCAACAATGTCAACAAGCTGCTCATCATTGACTACTCTGAGAACCGCCTG 

CTGGCCTGTGGGAGCCTCTACCAGGGGGTCTGCAAGCTGCTGCGGCTGGATGACCTCTTC 

ATCCTGGTGGAGCCATCCCACAAGAAGGAGCACTACCTGTCCAGTGTCAACAAGACGGGC 

ACCATGTACGGGGTGATTGTGCGCTCTGAGGGTGAGGATGGCAAGCTCTTCATCGGCACG 

GCTGTGGATGGGAAGCAGGATTACTTCCCGACCCTGTCCAGCCGGAAGCTGCCCCGAGAC 

CCTGAGTCCTCAGCCATGCTCGACTATGAGCTACACAGCGATTTTGTCTCCTCTCTCATC 

AAGATCCCTTCAGACACCCTGGCCCTGGTCTCCCACTTTGACATCTTCTACATCTACGGC 

TTTGCTAGTGGGGGCTTTGTCTACTTTCTCACTGTCCAGCCCGAGACCCCTGAGGGTGTG 

GCCATCAACTCCGCTGGAGACCTCTTCTACACCTCACGCATCGTGCGGCTCTGCAAGGAT 

GACCCCAAGTTCCACTCATACGTGTCCCTGCCCTTCGGCTGCACCCGGGCCGGGGTGGAA 

TACCGCCTCCTGCAGGCTGCTTACCTGGCCAAGCCTGGGGACTCACTGGCCCAGGCCTTC 

AATATCACCAGCCAGGACGATGTACTCTTTGCCATCTTCTCCAAAGGGCAGAAGCAGTAT 

CACCACCCGCCCGATGACTCTGCCCTGTGTGCCTTCCCTATCCGGGCCATCAACTTGCAG 

ATCAAGGAGCGCCTGCAGTCCTGCTACCAGGGCGAGGGCAACCTGGAGCTCAACTGGCTG 

CTGGGGAAGGACGTCCAGTGCACGAAGGCGCCTGTCCCCATCGATGATAACTTCTGTGGA 

CTGGACATCAACCAGCCCCTGGGAGGCTCAACTCCAGTGGAGGGCCTGACCCTGTACACC 

ACCAGCAGGGACCGCATGACCTCTGTGGCCTCCTACGTTTACAACGGCTACAGCGTGGTT 

TTTGTGGGGACTAAGAGTGGCAAGCTGAAAAAGGTAAGAGTCTATGAGTTCAGATGCTCC 

AATGCCATTCACCTCCTCAGCAAAGAGTCCCTCTTGGAAGGTAGCTATTGGTGGA 

GATTTAACTATAGGCAACTTTATTTTCTTGGGGAACAAAGG TGAA ATGGGGAGGTAAGAA 

GGGGTTAATTTTGTGACTTAGCTTCTAGCTACTTCCTCCAGCCATCAGTCATTGGGTATG 

TAAGGAATGCAAGCGTATTTCAATATTTCCOUVACTTTAAGAAAAAACTTTAAG 

CATCTGCAAAAGCAAA 
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Figure 14 

MGTLGQASLFAPPGNYFWSDHSALCFAESCEGQPGKVEQMSTHRSRLLTAAPLSMEQRQP 
WPRALEVDSRSVVLLSVVWVLIJVPPAAGMPQFSTFHSENRDWTFNHLTVHQGTGAVYVGA 
INRVYKLTGNLTIQVAHKTGPEEDNKSRYPPL I VQPCSEVLTLTNNVNKLLI IDYSENRL 
IACGSLYQGVCKLLRLDDLFILVEPSHKKEHYLSSVNKTGTMYGVIVRSEGEDGKLFIGT 
AVDGKQDYFPTLSSRKLPRDPESSAMLDYELHSDFVSSLIKIPSDTLALVSHFDIFYIYG 
FASGGFVYFLTVQPETPEGVAINSAGDLFYTSRIVRLCKDDPKFHSYVSLPFGCTRAGVE 
YRLLQAA YLAKPGDSLAQAFN I TSQDDVLF AI FS KGQKQYHHPPDDS ALCAFP I RAI NLQ 
IKERLQSCYQGEGNLELNWLLGKDVQCTKAPVPIDDNFCGLDINQPLGGSTPVEGLTLYT 
TSRDRMTSVASYVYNGYSWFVGTKSGKLKKVRVYEFRCSNAIHLLSKESLLEGSYWWRF 

NYRQLYFLGEQR 
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Figure 15 



CCCAGAAGTTCAAGGGCCCCCGGCCTCCTGCGCTCCTGCCGCCGGGACCCTCGACCTCCT 
CAGAGCAGCCGGCTGCCGCCCCGGGAAG 

ATGGCGAGGAGGAGCCGCCACCGCCTCCTCCTGCTGCTGCTGCGCTACCTGGTGGTCGCC 
CTGGGCTATCATAAGGCCTATGGGTTTTCTGCCCCAAAAGACCAACAAGTAGTCACAGCA 
GTAGAGTACCAAGAGGCTATTTTAG C CTGCAAAACCCCAAAGAAGACTGTTTCCTCCAGA 
TTAGAGTGGAAGAAACTGGGTCGGAGTGTCTCCTTTGTCTACTATCAACAGACTCTTCAA 
GGTGATTTTAAAAATCGAGCTGAGATGATAGATTTCAATATCCGGATCAAAAATGTGACA 
AGAAGTGATGCGGGGAAATATCGTTGTGAAGTTAGTGCCCCATCTGAGCAAGGCCAAAAC 
CTGGAAGAGGATACAGTCACTCTGGAAGTATTAGTGGCTCCAGCAGTTCCATCATGTGAA 
GTACCCTCTTCTGCTCTGAGTGGAACTGTGGTAGAGCTACGATGTCAAGACAAAGAAGGG 
AATCCAGCTCCTGAATACACATGGTTTAAGGATGGCATCCGTTTGCTAGAAAATCCCAGA 
CTTGGCTCCCAAAGCACCAACAGCTCATACACAATGAATACAAAAACTGGAACTCTGCAA 
TTTAATACTGTTTCCAAACTGGACACTGGAGAATATTCCTGTGAAGCCCGCAATTCTGTT 
GGATATCGCAGGTGTCCTGGGAAACGAATGCAAGTAGATGATCTCAACATAAGTGGCATC 
ATAGCAGCCGTAGTAGTTGTGGCCTTAGTGATTTCCGTTTGTGGCCTTGGTGTATGCTAT 
GCTCAGAGGAAAGGCTACTTTTCAAAAGAAACCTCCTTCCAG 

AAGAGTAATTCTTCATCTAAAGCCACGACAATGAGTGAAAATGTGCAGTGGCTCACGCCT 
GTAATCCCAGCACTTTGGAAGGCCGCGGCGGGCGGATCACGAGGTCAGGAGTTCTAGACC 
AGTCTGGCCAATATGGTGAAACCCCATCTCTACTAAAATACAAAAATTAGCTGGGCATGG 
TGGCATGTGCCTGCAGTTCCAGCTGCTTGGGAGACAGGAGAATCACTTGAACCCGGGAGG 
CGGAGGTTGCAGTGAGCTGAGATCACGC CACTGCAGTCCAGC CTGGGTAACAGAGCAAGA 
TTCCATCTCAAAAAATAAAATAAATAAATAAATAAATACTGGTTTTTACCTGTAGAATTC 
TTACAATAAATATAGCTTGATATTC 
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Figure 16 

MARRSRHRLLLLLLRYLVVALGYHKAYGFSAPKDQQVVTAVEYQEAILACKTPKKTVSSR 

LEWKKLGRSVSFVYYQQTLQGDFKNRAEMIDFNIRIKNVTRSDAGKYRCEVSAPSEQGQN 

LEEDTVTLEVLVAPAVPSCEVPSSALSGTVVELRCQDKEGNPAPEYTWFKDGIRLLENPR 

LGSQSTNSSYTMNTKTGTLQFm^SKLDTGEYSCEARNSVGYRRCPGKRMQVDDLNISGI 

IAAVVVVALVISVCGLGVCYAQRKGYFSKETSFQKSNSSSKATTMSENVQWLTPVIPALW 

KAAAGGSRGQEF 
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Figure 17 

TGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGC 
CCTGGCGGTGCTCTCGGCCTTGCTGTGTCAGGTCTGGAGCTCT 

GGGGTGTTCGAACTGAAGCTGCAGGAGTTCGTCAACAAGAAGGGGCTGCTGGGGAACCGC 
AATTGCTGCCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCCGGACCTTCTTCCGCGTG 
TGCCTCAAGCACTACCAGGCCAGCGTGTCCCCCGAGCCGCCCTGCACCTACGGCAGCGCC 
GTCACCCCCGTGCTGGGCGTCGACTCCTTCAGTCTGCCCGACGGCGGGGGCGCCGACTCC 
GCGTTCAGCAACCCCATCCGCTTCCCCTTCGGCTTCACCTGGCCGGGCACCTTCTCTCTG 
ATTATTGAAGCTCTCCACACAGATTCTCCTGATGACCTCGCAACAGAAAACCCAGAAAGA 
CTCATCAGCCGCCTGGCCACCCAGAGGCACCTGACGGTGGGCGAGGAGTGGTCCCAGGAC 
CTGCACAGCAGCGGCCGCACGGACCTCAAGTACTCCTACCGCTTCGTGTGTGACGAACAC 
TACTACGGAGAGGGCTGCTCCGTTTTCTGCCGTCCCCGGGACGATGCCTTCGGCCACTTC 
ACCTGTGGGGAGCGTGGGGAGAAAGTGTGCAACCCTGGCTGGAAAGGGCCCTACTGCACA 
GAGCCGATCTGCCTGCCTGGATGTGATGAGCAGCATGGATTTTGTGACAAACCAGGGGAA 
TGCAAGTGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCAGGC 
TGTCTCCATGGCACCTGCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGGGC 
CTTTTCTGCAACCAGGACCTGAACTACTGCACACACCATAAGCCCTGCAAGAATGGAGCC 
ACCTGCACCAACACGGGCCAGGGGAGCTACACTTGCTCTTGCCGGCCTGGGTACACAGGT 
GCCACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTAAGAACGGAGGGAGC 
TGCACGGATCTCGAGAACAGCTACTCCTGTACCTGCCCACCCGGCTTCTACGGCAAAATC 
TGTGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTCGGTGCTCA 
GACAGCCCCGATGGAGGGTACAGCTGCCGCTGCCCCGTGGGCTACTCCGGCTTCAACTGT 
GAGAAGAAAATTGACTACTGCAGCTCTTCACCCTGTTCTAATGGTGCCAAGTGTGTGGAC 
CTCGGTGATGCCTACCTGTGCCGCTGCCAGGCCGGCTTCTCGGGGAGGCACTGTGACGAC 
AACGTGGACGACTGCGCCTCCTCCCCGTGCGCCAACGGGGGCACCTGCCGGGATGGCGTG 
AACGACTTCTCCTGCACCTGCCCGCCTGGCTACACGGGCAGGAACTGCAGTGCCCCCGTC 
AGCAGGTGCGAGCACGCACCCTGCCACAATGGGGCCACCTGCCACGAGAGGGGCCACCGC 
TATGTGTGCGAGTGTGCCCGAGGCTACGGGGGTCCCAACTGCCAGTTCCTGCTCCCCGAG 
CTGCCCCCGGGCCCAGCGGTGGTGGACCTCACTGAGAAGCTAGAGGGCCAGGGCGGGCCA 
TTCCCCTGGGTGGCCGTGTGCGCCGGGGTCATCCTTGTCCTCATGCTGCTGCTGGGCTGT 
GCCGCTGTGGTGGTCTGCGTCCGGCTGAGGCTGCAGAAGCACCGGCCCCCAGCCGACCCC 
TGCCGGGGGGAGACGGAGACCATGAACAACCTGGCCAACTGCCAGCGTGAGAAGGACATC 
TCAGTCAGCATCATCGGGGCCACGCAGATCAAGAACACCAACAAGAAGGCGGACTTCCAC 
GGGGACCACAGCGCCGACAAGAATGGCTTCAAGGCCCGCTACCCAGCGGTGGACTATAAC 
CTCGTGCAGGACCTCAAGGGTGACGACACCGCCGTCAGGGACGCGCACAGCAAGCGTGAC 
ACCAAGTGCCAGCCCCAGGGCTCCTCAGGGGAGGAGAAGGGGACCCCGACCACACTCAGG 
GGTGGAGAAGCATCTGAAAGAAAAAGGCCGGACTCGGGCTGTTCAACTTCAAAAGACACC 
AAGTACCAGTCGGTGTACGTCATATCCGAGGAGAAGGATGAGTGCGTCATAGCAACTGAG 
GT GTAAA ATGGAAGTGAGATGGCAAGACTCCCGTTTCTCTTAAAATAAGTAAAATTCCAA 
GGATATATGCCCCAACGAATGCTG CTGAAGAGGAGGGAGG 

CCTCGTGGACTGCTGCTGAGAAACCGAGTTCAGACCGAGCAGGTTCTCCTCCTGAGGTCC 
TCGACGCCTGCCGACAGCCTGTCGCGGCCCGGCCGCCTGCGGCACTGCCTTCCGTGACGT 
CGCCGTTGCACTATGGACAGTTGCTCTTAAGAGAATATATATTTAAATGGGTGAACTGAA 
TTACGCATAAGAAGCATGCACTGCCTGAGTGTATATTTTGGATTCTTATGAGCCAGTCTT 
TTCTTGAATTAGAAACACAAACACTGCCTTTATTGTC^ 

TTTCTAGATGGAAAAGATGTGTGTTATTTTTTGGATTTGTAAAAATATTTTTCATGATAT 

CTGTAAAGCTTGAGTATTTTGTGATGTTCGTTTTTTATAATTTAAATTTTGGTAAATATG 

TACAAAGGCACTTCGGGTCTATGTGACTATATTTTTTTGTATATAAATGTATTTATGGAA 

TATTGTGCAAATGTTATTTGAGTTTTTTACTGTTTTGTTAATGAAGAAATTCCTTTTTAA 

AATATTTTTCCAAAATAAATTTTATGAATGACAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 18 

MGSRCALALAVLSALLCQVWSSGVFELKLQEFVNKKGLLGNRNCCRGGAGPPPCACRTF 
FRVCLKHYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGGGADSAFSNPIRFPFGFTWPG 
TFSLIIEALHTDSPDDLATENPERLISRLATQRHLTVGEEWSQDLHSSGRTDLKYSYRF 
VCDEH YYGEGCS VFCRPRDDAFGHFTCGERGEKVCNPGWKGP YCTE P I CL PGCDEQHGF 
CDKPGECKCRVGWQGR YCDEC I R Y PGCLHGTCQQPWQCNCQEGWGGLFCNQDLNYCTHH 
KPCKNGATCTNTGQGSYTCSCRPGYTGATCELGIDECDPSPCIQIGGSCTDLENSYSCTC 
PPGFYGKICELSAMTCADGPCFNGGRCSDSPDGGYSCRCPVGYSGFNCEKKIDYCSSSP 
CSNGAKCVDLGDAYLCRCQAGFSGRHCDDNVDDCASSPCANGGTCRDGVNDFSCTCPPG 
YTGRNCSAPVSRCEHAPCHNGATCHERGHRYVCECARGYGGPNCQFLLPELPPGPAWD 
LTEKLEGQGGPFPWVAVCAGVILVLMLLLGCAAVWCVRLRLQKHRPPADPCRGETETM 
NNLANCQRE KD I SVS I IGATQI KNTNKKADFHGDHS ADKNGFKARYPAVDYNLVQDLKG 
DDTAVRDAHSKRDTKCQPQGSSGEEKGTPTTLRGGEASERKRPDSGCSTSKDTKYQSVY 
VISEEKDECVIATEV 



18/133 



WO 00/53758 



PCT/USOO/05841 



Figure 19 

GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGG 

CGCAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCT 

GGGTGGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCC 

ATGTCCCTGCTCCCACGCCGCGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGCGCTG 

CTCCTGCTGCTGCTGGCGCTGTACACCGCGCGTGTGGACGGGTCCAAATGCAAGTGCTCC 

CGGAAGGGACCCAAGATCCGCTACAGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTAC 

CCGCACTGCGAGGAGAAGATGGTTATCATCACCACCAAGAGCGTGTCCAGGTACCGAGGT 

CAGGAGCACTGCCTGCAC C CCAAGCTG CAGAGCACCAAGCGCTTCATCAAGTGGTACAAC 

GCCTGGAACGAGAAGCGCAGGGTCTACGAAGAATAGGGTGAAAAACCTCAGAAGGGAAAA 

CTCCAAACCAGTTGGGAGACTTGTGCAAAGGACTTTGCAGATTAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAGCCTTTCTTTCTCACAGGCATAAGACACAAATT 

ATATATTGTTATGAAGCACTTTTTACCAACGGTCAGTTTTTACATTTTATAGCTGCGTGC 

GAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCTCCAGAC 

TTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAAAAAA 
TGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGAGG 
AACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 
TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGC 
CGGTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGT 
CTCTCTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTC 
CCC CCTGGTTAATTTTTACACAC CCTAGGAAACATTTC CAAGATC CTGTGATGG CGAGAC 
AAATGATCCTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCAC 
AGGTTCAATATTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCT 
TAGGAGAAAACTTAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTC 
TGTTGACAAGGGAAAACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAG 
TACTAAAGCAATATATTTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCC 
TCTTTCGAAAGCTAAGATGACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGC 
CCCCTCCACACACTGCCCCCCAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGT 
ACATGTCAGAAACCATTAGCATTGCATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTA 
ATAAAATATATTTGAAATGTAAAAAAAAAAAAAAA 
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Figure 20 

MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKY 
PHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE 
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Figure 21 

GGAGAGCGGAGCGAAGCTGGATAACAGGGGACCG 

ATGATGTGGCGACCATCAGTTCTGCTGCTTCTGTTGCTACTGAGGCACGGGGCCCAGGGG 
AAGCCATCCCCAGACGCAGGCCCTCATGGCCAGGGGAGGGTGCACCAGGCGGCCCCCCTG 
AGCGACGCTCCCCATGATGACGCCCACGGGAACTTCCAGTACGACCATGAGGCTTTCCTG 
GGACGGGAAGTGGCCAAGGAATTCGACCAACTCACCCCAGAGGAAAGCCAGGCCCGTCTG 
GGGCGGATCGTGGACCGCATGGACCGCGCGGGGGACGGCGACGGCTGGGTGTCGCTGGCC 
GAGCTTCGCGCGTGGATCGCGCACACGCAGCAGCGGCACATACGGGACTCGGTGAGCGCG 
GCCTGGGACACGTACGACACGGACCGCGACGGGCGTGTGGGTTGGGAGGAGCTGCGCAAC 
GCCACCTATGGCCACTACGCGCCCGGTGAAGAATTTCATGACGTGGAGGATGCAGAGACC 
TACAAAAAGATGCTGGCTCGGGACGAGCGGCGTTTCCGGGTGGCCGACCAGGATGGGGAC 
TCGATGGCCACTCGAGAGGAGCTGACAGCCTTCCTGCACCCCGAGGAGTTCCCTCACATG 
CGGGACATCGTGATTGCTGAAACCCTGGAGGACCTGGACAGAAACAAAGATGGCTATGTC 
CAGGTGGAGGAGTACATCGCGGATCTGTACTCAGCCGAGCCTGGGGAGGAGGAGCCGGCG 
TGGGTGCAGACGGAGAGGCAGCAGTTCCGGGACTTCCGGGATCTGAACAAGGATGGGCAC 
CTGGATGGGAGTGAGGTGGGCCACTGGGTGCTGCCCCCTGCCCAGGACCAGCCCCTGGTG 
GAAGCCAACCACCTGCTGCACGAGAGCGACACGGACAAGGATGGGCGGCTGAGCAAAGCG 
GAAATCCTGGGTAATTGGAACATGTTTGTGGGCAGTCAGGCCACCAACTATGGCGAGGAC 
CTGACCCGGCACCACGATGAGCTGTGAGCACCGCGCACCTGCCACAGCCTCAGAGGCCCG 
CACAATGACCGGAGGAGGGGCCGCTGTGGTCTGGCCCCCTCCCTGTCCAGGCCCCGCAGG 
AGGCAGATGCAGTCCCAGGCATCCTCCTGCCCCTGGGCTCTCAGGGACCCCCTGGGTCGG 
CTTCTGTCCCTGTCACACCCCCAACCCCAGGGAGGGGCTGTCATAGTCCCAGAGGATAAG 
CAATACCTATTTCTGACTGAGTCTCCCAGCCCAGACCCAGGGACCCTTGGCCCCAAGCTC 
AGCTCTAAGAACCGCCCCAACCCCTCCAGCTCCAAATCTGAGCTCTCCACCACATAGACTG 
AAACTCCCCTGGCCCCAGCCCTCTCCTGCCTGGCCTGGCCTGGGACACCTCCTCTCTGCC 
AGGAGGCAATAAAAGCCAGCGCCGGGACCTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 22 

MMWRPSVLLLLLLLRHGAQGKPSPDAGPHGQGRVHQAAPLSDAPHDDAHGNFQYDHEAFL 
GREVAKEFDQLTPEESQARLGRIVDRMDRAGDGDGWVSLAELRAWIAHTQQRHIRDSVSA 
AWDTYDTDRDGRVGWEELRNATYGHYAPGEEFHDVEDAETYKKMLARDERRFRVADQDGD 
SMATREELTAFLHPEEFPHMRDIVIAETLEDLDRNKDGYVQVEEYIADLYSAEPGEEEPA 
WVQTERQQFRDFRDLNKDGHLDGSEVGHWVLPPAQDQPLVEANHLLHESDTDKDGRLSKA 
E I LGNWNMFVGSQATNYGEDLTRHHDEL 
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Figure 23 

CAAAACITGCGTCGCGGAGAGCGCCCAGCTTGACTTGAATGGAAGGAGCCCGAGCCCGCG 
GAGCGCAGCTGAGACTGGGGGAGCGCGTTCGGCCTGTGGGGCGCCGCTCGGCGCCGGGGC 
GCAGCAGGGAAGGGGAAGCTGTGGTCTGCCCTGCTCCACGAGGCGCCACTGGTGTGAACC 
GGGAGAGCCCCTGGGTGGTCCCGTCCCCTATCCCTCCTTTATATAGAAACCTTCCACACT 
GGGAAGGCAGCGGCGAGGCAGGAGGGCTCATGGTGAGCAAGGAGGCCGGCTGATCTGCAG 
GCGCACAGCATTCCGAGTTTACAGATTTTTACAGATACCAA 

ATGGAAGGCGAGGAGGCAGAACAGCCTGCCTGGTTCCATCAGCCCTGGCGCCCAGGCGCA 
TCTGACTCGGCACCCCCTGCAGGCACCATGGCCCAGAGCCGGGTGCTGCTGCTCCTGCTG 
CTGCTGCCGCCACAGCTGCACCTGGGACCTGTGCTTGCCGTGAGGGCCCCAGGATTTGGC 
CGAAGTGGCGGCCACAGCCTGAGCCCCGAAGAGAACGAATTTGCGGAGGAGGAGCCGGTG 
CTGGTACTGAGCCCTGAGGAGCCCGGGCCTGGCCCAGCCGCGGTCAGCTGCCCCCGAGAC 
TGTGCCTGTTCCCAGGAGGGCGTCGTGGACTGTGGCGGTATTGACCTGCGTGAGTTCCCG 
GGGGACCTGCCTGAGCACACCAACCACCTATCTCTGCAGAACAACCAGCTGGAAAAGATC 
TACCCTGAGGAGCTCTCCCGGCTGCACCGGCTGG^GACACTGAACCTGCAAAACAACCGC 
CTGACTTCCCGAGGGCTCCCAGAGAAGGCGTTTGAGCATCTGACCAACCTCAATTACCTG 
TACTTGGCCAATAACAAGCTGACCTTGGCACCCCGCTTCCTGCCAAACGCCCTGATCAGT 
GTGGACTTTGCTGCCAACTATCTCACCAAGATCTATGGGCTCACCTTTGGCCAGAAGCCA 
AACTTGAGGTCTGTGTACCTGCACAACAACAAGCTGGCAGACGCCGGGCTGCCGGACAAC 
ATGTTCAACGGCTCCAGCAACGTCGAGGTCCTCATCCTGTCCAGCAACTTCCTGCGCCAC 
GTGCCCAAGCACCTGCCGCCTGCCCTGTACAAGCTGCACCTCAAGAACAACAAGCTGGAG 
AAGATCCCCCCGGGGGCCTTCAGCGAGCTGAGCAGCCTGCGCGAGCTATACCTGCAGAAC 
AACTACCTGACTGACGAGGGCCTGGACAACGAGACCTTCTGGAAGCTCTCCAGCCTGGAG 
TACCTGGATCTGTCCAGCAACAACCTGTCTCGGGTCCCAGCTGGGCTGCCGCGCAGCCTG 
GTGCTGCTGCACTTGGAGAAGAACGCCATCCGGAGCGTGGACGCGAATGTGCTGACCCCC 
ATCCGCAGCCTGGAGTACCTGCTGCTGCACAGCAACCAGCTGCGGGAGCAGGGCATCCAC 
CCACTGGCCTTCCAGGGCCTCAAGCGGTTGCACACGGTGCACCTGTACAACAACGCGCTG 
GAGCGCGTGCCCAGTGGCCTGCCTCGCCGCGTGCGCACCCTCATGATCCTGCACAACCAG 
ATCACAGGCATTGGCCGCGAAGACTTTGCCACCACCTACTTCCTGGAGGAGCTCAACCTC 
AGCTACAACCGCATCACCAGCCCACAGGTGCACCGCGACGCCTTCCGCAAGCTGCGCCTG 
CTGCGCTCGCTGGACCTGTCGGGCAACCGGCTGCACACGCTGCCACCTGGGCTGCCTCGA 
AATGTCCATGTGCTGAAGGTCAAGCGCAATGAGCTGGCTGCCTTGGCACGAGGGGCGCTG 
GCGGGCATGGCTCAGCTGCGTGAGCTGTACCTCACCAGCAACCGACTGCGCAGCCGAGCC 
CTGGGCCCCCGTGCCTGGGTGGACCTCGCCCATCTGCAGCTGCTGGACATCGCCGGGAAT 
CAGCTCACAGAGATCCCCGAGGGGCTCCCCGAGTCACTTGAGTACCTGTACCTGCAGAAC 
AACAAGATTAGTGCGGTGCCCGCCAATGCCTTCGACTCCACGCCCAACCTCAAGGGGATC 
TTTCTCAGGTTTAACAAGCTGGCTGTGGGCTCCGTGGTGGACAGTGCCTTCCGGAGGCTG 
AAGCACCTGCAGGTCTTGGACATTGAAGGCAACTTAGAGTTTGGTGACATTTCCAAGGAC 
CGTGGCCGCTTGGGGAAGGAAAAGGAGGAGGAGGAAGAGGAGGAGGAGGAGGAAGAGGAA 
ACAAGATAGTGAC AAGGTGATGCAGATG TGAC CTAGGATGATGGACCG CCGGACTCTTTT 
CTGCAGCACACGCCTGTGTGCTGTGAGCCCCCCACTCTGCCGTGCTCACACAGACACACC 
CAGCTGCACACATGAGGCATCCCACATGACACGGGCTGACACAGTCTCATATCCCCACCC 
CTTCCCACGGCGTGTCCCACGGCCAGACACATGCACACACATCACACCCTCAAACACCCA 
GCTCAGCCACACACAACTACCCTCCAAACCACCACAGTCTCTGTCACACCCCCACTACCG 
CTGCCACGCCCTCTGAATCATGCAGGGAAGGGTCTGCCCCTGCCCTGGCACACACAGGCA 
CCCATTCCCTCCCCCTGCTGACATGTGTATGCGTATGCATACACACCACACACACACACA 
TGCACAAGTCATGTGCGAACAGCCCTCCAAAGCCTATGCCACAGACAGCTCTTGCCCCAG 
CCAGAATCAGCCATAGCAGCTCGCCGTCTGCCCTGTCCATCTGTCCGTCCGTTCCCTGGA 
GAAGACACAAGGGTATCCATGCTCTGTGGCCAGGTGCCTGCCACCCTCTGGAACTCACAA 
AAGCTGGCTTTTATTCCTTTCCCATCCTATGGGGACAGGAGCCTTCZAGGACTGCTGGCCT 
GGCCTGGCCCACCCTGCTCCTCCAGGTGCTGGGCAGTCACTCTGCTAAGAGTCCCTCCCT 
GCCACGCCCTGGCAGGACACAGGCACTTTTCCAATGGGCAAGCCCAGTGGAGGCAGGATG 
GGAGAGCCCCCTGGGTGCTGCTGGGGCCTTGGGGCAGGAGTGAAGCAGAGGTGATGGGGC 
TGGGCTGAGCCAGGGAGGAAGGACCCAGCTGCACCTAGGAGACACCTTTGTTCTTCAGGC 
CTGTGGGGGAAGTTCCGGGTGCCTTTATTTTTTATTCTTTTCTAAGGAAAAAAATGATAA 
AAATCTCAAAGCTGATTTTTCTTGTTATAGAAAAACTAATATAAAAGCATTATCCCTATC 
CCTGCAAAAAAAAAA 
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Figure 24 

MEGEEMQPAWFHQPVmPGASDSAPPAGTMAQSRVLLLIiLLLPPQLHLGPVLAVRAPGFG 
RSGGHSLSPEENEFAEEEPVLVLSPEEPGPGPAAVSCPRDCACSQEGWDCGGIDLREFP 
GDLPEHTNHLSLQNNQLEKIYPEELSRIiHRLETLNLQNNRLTSRGLPEKAFEHLTNLNYL 
YLANNKLTLAPRFLPNALISVDFAANYLTKIYGLTFGQK^ 

MFNGSSNVEVLILSSNFLRHVPKHLPPALYKLHLKNNKLEKIPPGAFSELSSLRELYLQN 
NYLTDEGLDNETFWKLSSLEYLDLSSNNLSRVPAGLPRSLVLLHLEKNAIRSVDANVLTP 
IRSLEYLLLHSNQLREQGIHPLAFQGLKRLHTVHLYNNALERVPSGLPRRVRTLMILHNQ 
ITGIGREDFATTYFLEELNLSYNRITSPQVHRDAFRKLRLLRSLDLSGNRLHTLPPGLPR 
NVHVLKVKRNEIJWU^RGALAGMAQLRELYLTSNRLRSRALGPRAWVDLAHLQLLDIAG^ 
QLTEIPEGLPESLEYLYLQNNKI SAVPANAFDSTPNLKGIFLRFNKLAVGSWDSAFRRL 
KHLQVI^DIEGNLEFGDISKDRGRLGKEKEEEEEEEEEEEETR 
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Figure 25 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCC 
GCCGCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGC 
CCGCGCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCC 
GCCCCGCCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCC 
CATAAAACATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCC 
GCCGCCCTCGCCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGA 
GCCGGGCGGAGCGGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAG 
CGGCGGCGCCTGGATGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTT 
TCAGTCCCCGACGCGCCCCGCCCAACCCCTACG 

ATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTGCTGTGGCTGCAGGCC 
TGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGAGCCCAAGGTGACG 
ACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTGCTGCCAGCCAG 
CGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTCCGTGCCTGC 
CGCAACGTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGCGGCTGCC 
TTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCCGGTCT 
GTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGCTGC 
GGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTAC 
CTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTC 
ACACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGG 
CTGCACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCAT 
GCCTTCCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCG 
CTGCCCACTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAAC 
CCCTGGGTGTGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGC 
TCCTCCTCCGAGGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGC 
CTAGCTGCCAATGACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGG 
ACCGGCAGGGCC\CCGATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCC 
GCTGACAAGGCCTCAGTACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAG 
GGACGCGTGCCGCCCGGTGACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAAT 
GACTCACCCTTTGGGACTCTGCCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCC 
GAGGGCTCCGAGCCACCAGGGTTCCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTTCA 
CGCAAGAACCGCACCCGCAGCCACTGCCGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGG 
ACTGGTGACTCAGAAGGCTCAGGTGCCCTACCCAGCCTCACCTGCAGCCTCACCCCCCTG 
GGCCTGGCGCTGGTGCTGTGGACAGTGCTTGGGCCCTGCTGACCCCCAGCGGACACAAGA 
GCGTGCTCAGCAGCCAGGTGTGTGTACATACGGGGTCTCTCTCCACGCCGCCAAGCCAGC 
CGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAGGTCCTCCCTGATGGACGCCTGCCGCCC 
GCCACCCCCATCTCCACCCCATCATGTTTACAGGGTTCGGCGGCAGCGTTTGTTCCAGAA 
CGCCGCCTCCCACCCAGATCGCGGTATATAGAGATATGCATTTTATTTTACTTGTGTAAA 
AATATCGGACGACGTGGAATAAAGAGCTCTTTTCTTAAAAAAA 
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Figure 26 

MKRASAGGSRLLAWVLWLQAWQVAA.PCPGACVCYNEPKVTTSCPQQGLQAVPVG I PAAS 
QRIFLHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLALLEQLDLSDNAQL 
RS VD PAT FHGLGRLHTLHLDRCGLQELGPGL F RGLAALQ YL YLQDNALQAL PDDTFRDL 
GNLTHLFLHGNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPBiAFRDLGRLMTLYLFAN 
NLSALPTEALAPLRALQYLRLNDNPWVCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAG 
RDLKRI.AANDLQGCAVATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPAS 
AGNALKGRVPPGDSPPGNGSGPRHINDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGP 
RRRPGCSRKNRTRSHCRLGQAGSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGP 
C 
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Figure 27 

GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCACCCCGGTCTGCGTC 

ATGT TAAACTCCAATGTCCTCCTGTGGTTAACTGCTCTTGCCATCAAGTTCACCCTCATT 

GACAGC CAAG CACAGTAT C CAGTTGTCAACACAAATTATGG CAAAAT C CGGGG C CTAAGA 

ACACCGTTACCCAATGAGATCTTGGGTCCAGTGGAGCAGTACTTAGGGGTCCCCTATGCC 

TCACCCCCCACTGGAGAGAGGCGGTTTCAGCCCCCAGAACCCCCGTCCTCCTGGACTGGC 

ATCCGAAATACTACTCAGTTTGCTGCTGTGTGCCCCCAGCACCTGGATGAGAGATCCTTA 

CTGC^TGACATGCTGCCCATCTGGTTTACCGCCAATTTGGATACTTTGATGACCTATGTT 

CAAGATCAAAATGAAGACTG C CTTTACTTAAACATCTACGTGCCCACGGAAGATGGAGCC 

AACACAAAGAAAAACGCAGATGATATAACGAGTAATGACCGTGGTGAAGACGAAGATATT 

CATGATCAGAACAGTAAGAAGCCCGTCATGGTCTATATCCATGGGGGATCTTACATGGAG 

GGCACCGG CAACATGATTGACGGCAGCATTTTGGCAAGCTACGGAAACGTCATCGTGATC 

ACCATTAACTACCGTCTGGGAATACTAGGGTTTTTAAGTACCGGTGACCAGGCAGCAAAA 

GGCAACTATGGGCTCCTGGATCAGATTCAAGCACTGCGGTGGATTGAGGAGAATGTGGGA 

GCCTTTGGCGGGGACCCCAAGAGAGTGACCATCTTTGGCTCGGGGGCTGGGGCCTCCTGT 

GTCAGCCTGTTGACCCTGTCCCACTACTCAGAAGGTCTCTTCCAGAAGGCCATCATTCAG 

AGCGGCACCGCCCTGTCCAGCTGGGCAGTGAACTACCAGCCGGCCAAGTACACTCGGATA , 

TTGGCAGACAAGGTCGGCTGCAACATGCTGGACACCACGGACATGGTAGAATGCCTGCGG 

AACAAGAACTACAAGGAGCTCATCCAGCAGACCATCACCCCGGCCACCTACCACATAGCC 

TTCGGGCCGGTGATCGACGGCGACGTCATCCCAGACGACCCCCAGATCCTGATGGAGCAA 

GGCGAGTTCCTCAACTACGACATCATGCTGGGCGTCAACCAAGGGGAAGGCCTGAAGTTC 

GTGGACGGCATCGTGGATAACGAGGACGGTGTGACGCCCAACGACTTTGACTTCTCCGTG 

TCCAACTTCGTGGACAACCTTTACGGCTACCCTGAAGGGAAAGACACTTTGCGGGAGACT 

ATCAAGTTCATGTACACAGACTGGGCCGATAAGGAAAACCCGGAGACGCGGCGGAAAACC 

CTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCTGCAC 

GCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCAAAGCGAAATG 

AAGCCCAGCTGGGCAGATTCGGCCCA.TGGTGATGAGGTCCCCTATGTCTTCGGCATCCCC 

ATGATCGGTCCCACCGAGCTCTTCAGTTGTAACTTTTCCAAGAACGACGTCATGCTCAGC 

GCCGTGGTCATGACCTACTGGACGAACITCGCCAAAACTGGTGATCCAAATCAACCAGTT 

CCTCAGGATACCAAGTTCATTCACACAAAACCCAACCGCTTTGAAGAAGTGGCCTGGTCC 

AAGTATAATCCCAAAGACC^GCTCTATCTGCATATTGGCTTGAAACCCAGAGTGAGAGAT 

CACTACCGGGCAACGAAAGTGGCTTTCTGGTTGGAACTCGTTCCTCATTTGCACAACTTG 

AACGAGATATTCCAGTATGTTTCAACAACCACAAAGGTTCCTCCACCAGACATGACATCA 

TTTCCCTATGGCACCCGGCGATCTCCCGCCAAGATATGGCCAACCACCAAACGCCCAGCA 

ATCACTCCTGCCAACAATCCCAAACACTCTAAGGACCCTCACAAAACAGGGCCTGAGGAC 

ACAACTGTCCTCATTGAAACCAAACGAGATTATTCCACCGAATTAAGTGTCACCATTGCC 

GTCGGGGCGTCGCTCCTCTTCCTCAACATCTTAGCTTTTGCGGCGCTGTACTACAAAAAG 

GACAAGAGGCGCCATGAGACTCACAGGCGCCCCAGTCCCCAGAGAAACACCACAAATGAT 

ATCGCTCACATCCAGAACGAAGAGATCATGTCTCTGCAGATGAAGCAGCTGGAACACGAT 

CACGAGTGTGAGTCGCTGCAGGCACACGACACACTGAGGCTCACCTGCCCGCCAGACTAC 

ACCCTCACGCTGCGCCGGTCGCCAGATGACATCCCACTTATGACGCCAAACACCATCACC 

ATGATTCO^CACACTGACGGGGATGCAGCCTTTGCACACTTTTAACACCTTCAGTGGA 

GGACAAAACAGTACAAATTTACCCCACGGACATTCCACCACTAGAGT ATAGC TTTGCCCT 

ATTTCCCTTCCTATCCCTCTGCCCTACCCGCTCAGCAACATAGAAGAGGGAAGGAAAGAG 

AGAAGGAAAGAGAGAGAGAAAGAAAGTCTCCAGACCAGGAATGTTTTTGTCCCACTGACT 

TAAGACAAAAATGCAAAAAGGCAGTCATCCCATCCCGGCAGACCCTTATCGTTGGTGTTT 

TCCAGTATTACAAGATCAACTTCTGACCCTGTGAAATGTGAGAAGTACACATTTCTGTTA 

AAATAACTGCTTTAAGATCTCTACCACTCCAATCAATGTTTAGTGTGATAGGACATCACC 

ATTTCAAGGCCCCGGGTGTTTCCAACGTCATGGAAGCAGCTGACACTTCTGAAACTCAGC 

CAAGGACACTTGATATTTTTTAATTACAATGGAAGTTTAAACATTTCTTTCTGTGCCACA 

CAATGGATGGCTCTCCTTAAGTGAAGAAAGAGTCAATGAGATTTTGCCCAGCACAGGAGC 

TGTAATCCAGAGAGAAGGAAACGTAGAAATTTATTATTAAAAGAATGGACTGTGCAGCGA 

AATCTGTACGGTTCTGTGCAAAGAGGTGTTTTGCCAGCCTGAACTATATTTAAGAGACTT 

TGT 
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Figure 28 

MLNSNVLLWLTALAIKFTLIDSQAQYPVVNTNYGKI^ 

SPPTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYV 
QDQNEDC^YLNIYVPTEDGAm'KKNADD.ITSNDRGEDEDIHDQNSKKPVMVYIHGGSYME 
GTGNMIDGSILASYGNVIVITINYRLGILGFLSTGDQAAKGNYGLLDQIQALRWIEENVG 
AFGGDPKRVTIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRI 
LADKVGCNMLDTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQ 
GEFL^DIMLGVNQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRET 
I KFMYTDWADKENPETRRKTLVALFTDHQWVAPAVAADLHAQYGS PTYFYAFYHHCQS EM 
KPSWADSAHGDEVPYVFGIPMIGPTELFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPV 
PQDTKFIHTKPNRFEEVAWSKYNPKDQLYLHIGLKPRVRDHYRATKVAFWLELVPHLHNL 
NEIFQYVSTTTKVPPPDMTSFPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPED 
TTVLI ETKRDYSTELSVTIAVGASLLFLNILAFAALYYKKDKRRHETHRRPS PQRNTTND 
IAHIQNEEIMSLQMKQLEHDHECESLQAHDTLRLTCPPDYTLTLRRSPDDIPLMTPNTIT 
MIPNTLTGMQPLHTFNTFSGGQNSTNLPHGHSTTRV 
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Figure 2 9 

GCGGAGCATCCGCTGCGGTCCTCGCCGAGACCCCCGCGCGGATTCGCCGGTCCTTCCCGC 
GGGCGCGACAGAGCTGTCCTCGCACCTGGATGGCAGCAGGGGCGCCGGGGTCCTCTCGAC 
GCCAGAGAGAAATCTCATCATCTGTGCAGCCTTCTTAAAGCAAACTAAGACCAGAGGGAG 
GATTATCCTTGACCTTTGAAGACCAAAACTAAACTGAAATTTAAA 

ATG TTCTTCGGGGGAGAAGGGAGCTTGACTTACACTTTGGTAATAATTTGCTTCCTGACA 

CTAAGGCTGTCTGCTAGTCAGAATTGCCTCAAAAAGAGTCTAGAAGATGTTGTCATTGAC 

ATCCAGTCATCTCTTTCTAAGGGAATCAGAGGCAATGAGCCCGTATATACTTCAACTCAA 

GAAGACTGCATTAATTCTTGCTGTTCAACAAAAAACATATCAGGGGACAAAGCATGTAAC 

TTGATGATCTTCGACACTCGAAAAACAGCTAGACAACCCAACTGCTACCTATTTTTCTGT 

CCCAACGAGGAAG C CTGTCCATTGAAACCAGCAAAAGGACTTATGAGTTACAGGATAATT 

ACAGATTTTCCATCTTTGACCAGAAATTTGCCAAGCCAAGAGTTACCCCAGGAAGATTCT 

CTCTTACATGGCCAATTTTCACAAGCAGTCACTCCCCTAGCCCATCATCACACAGATTAT 

TCAAAGCCCACCGATATCTCATGGAGAGACACACTTTCTCAGAAGTTTGGATCCTCAGAT 

CACCTGGAGAAACTATTTAAGATGGATGAAGCAAGTGCCCAGCTCCTTGCTTATAAGGAA 

AAAGGCCATTCTCAGAGTTCACAATTTTCCTCTGATCAAGAAATAGCTCATCTGCTGCCT 

GAAAATGTGAGTGCGCTCCCAGCTACGGTGGCAGTTGCTTCTCCACATACCACCTCGGCT 

ACTCCAAAGCCCGCCACCCTTCTACCCACOVATGCTTCAGTGACACCTTCTGGGACTTCC 

CAGCCACAGCTGGCCACCACAGCTCCACCTGTAACCACTGTCACTTCTCAGCCTCCCACG 

ACCCTCATTTCTACAGTTTTTACACGGGCTGCGGCTACACTCCAAGCAATGGCTACAACA 

GCAGTTCTGACTACCACCTTTCAGGCACCTACGGACTCGAAAGGCAGCTTAGAAACCATA 

CCGTTTACAGAAATCTCCAACTTAACTTTGAACACAGGGAATGTGTATAACCCTACTGCA 

CTTTCTATGTCAAATGTGGAGTCTTCCACTATGAATAAAACTGCTTCCTGGGAAGGTAGG 

GAGGCCAGTCCAGGCAGTTCCTCCCAGGGCAGTGTTCCAGAAAATCAGTACGGCCTTCCA 

TTTGAAAAATGGCTTCTTATCGGGTCCCTGCTCTTTGGTGTCCTGTTCCTGGTGATAGGC 

CTCGTCCTCCTGGGTAGAATCCTTTCGGAATCACTCCGCAGGAAACGTTA 

CTCAAGACTGGATTATTTGATCAATGGGATCTATGTGGACATC TAAG GATGGAACTCGGT 

GTCTCTTAATTCATTTAGTAACCAGAAGCCCAAATGCAATGAGTTTCTGCTGACTTGCTA 

GTCTTAGCAGGAGGTTGTATTTTGAAGACAGGAAAATGCCCCCTTCTGCTTTCCTTTTTT 

TTTTTGGAGACAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTAGCACGATCTCGGC 

TCTCACCGCAACCTCCGTCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTAAGTA 

TCTGGGATTACAGGCATGTGCCACCACACCTGGGTGATTTTTGTATTTTTAGTAGAGACG 

GGGTTTCACCATGTTGGTCAGGCTGGTCTCAAACTCCTGACCTAGTGATCCACCCTCCTC 

GGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACAGCTGGCCCCCTTCTGTTTT 

ATGTTTGGTTTTTGAGAAGGAATGAAGTGGGAACCAAATTAGGTAATTTTGGGTAATCTG 

TCTCTAAAATATTAGCTAAAAACAAAGCTCTATGTAAAGTAATAAAGTATAATTGCCATA 

TAAATTTCAAAATTCAACTGGCTTTTATGCAAAGAAACAGGTTAGGACATCTAGGT^ 

ATTCATTCACATTCITGGTTCCAGATAAAATCAACTGTTTATATCAATTTCTAATGG 

TGCTTTTCTTTTTATATGGATTCCTTTAAAACTTATTCCAGATGTAGTTCCTTCCAATTA 

AATATTTGAATAAATCTTTTGTTACTCAA 
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ATTTTCTGTGAAGGAACCAACTGATCTCCCCCACCCTTGGATTAGAGTTCCTGCTCTACC 
TTACCCACAGATAACACATGTTGTTTCTACTTGTAAATGTAAAGTCTTTAAAATAAACTA 
TTACAGATAAAAAA 
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Figure 30 

MFFGGEGSLTYTLVIICFLTLRLSASQNCLKKSLEDWIDIQSSLSKGIRGNEPVYTSTQ 
EDCINSCCSTKNI SGDKACNLMI FDTRKTARQPNCYLFFCPNEEACPLKPAKGLMSYRI I 
TDFPSLTRNLPSQELPQEDSLLHGQFSQAVTPLAHHHTDYSKPTDISWRDTLSQKFGSSD 
HLEKLFKMDEASAQLLAYKEKGHSQSSQFSSDQEIAHLLPENVSALPATVAVASPHTTSA 
TPKPATLLPTNASVTPSGTSQPQLATTAPPVTTVTSQPPTTLISTVFTRAAATLQAMATT 
AVLTTTFQAPTDS KGS LET IPFTEI SNLTLNTGNVYNPTALSMSNVE S S TMNKTASWEGR 
EASPGSSSQGSVPENQYGLPFEKWLLIGSLLFGVLFLVIGLVLLGRILSESLRRKRYSRL 

DYLINGIYVDI 
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Figure 3 1 

CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAG 
TTTGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGACAGAAGTAGCTCTGGCTGTG 
ATGGGGATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGACACTTATGGCCGT 
CCCATCCTGGAAGTGCCAGAGAGTGTAACAGGACCTTGGAAAGGGGATGTGAATCTTCCC 
TGCACCTATGACCCCCTGCAAGGCTACACCCAAGTCTTGGTGAAGTGGCTGGTACAACGT 
GGCTCAGACCCTGTCACCATCTTTCTACGTGACTCTTCTGGAGACCATATCCAGCAGGCA 
AAGTACCAGGGCCGCCTGCATGTGAGCCACAAGGTTCCAGGAGATGTATCCCTCCAATTG 
AGCACCCTGGAGATGGATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCT 
GATGGCAACCAAGTCGTGAGAGATAAGATTACTGAGCTCCGTGTCCAGAAACTCTCTGTC 
TCCAAGCCCACAGTGACAACTGGCAGCGGTTATGGCTTCACGGTGCCCCAGGGAATGAGG 
ATTAGCCTTCAATGCCAGGCTCGGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAA 
CAGACTAATAACCAGGAACCCATCAAAGTAGCAACCCTAAGTACCTTACTCTTCAAGCCT 
GCGGTGATAGCCGACTCAGGCTCCTATTTCTGCACTGCCAAGGGCCAGGTTGGCTCTGAG 
CAGCACAGCGACATTGTGAAGTTTGTGGTCAAAGACTCCTCAAAGCTACTCAAGACCAAG 
ACTGAGGCACCTACAACCATGACATACCCCTTGAAAGCAACATCTACAGTGAAGCAGTCC 
TGGGACTGGACCACTGACATGGATGGCTACCTTGGAGAGACCAGTGCTGGGCCAGGAAAG 
AGCCTGCCTGTCTTTGCCATCATCCTCATCATCTCCTTGTGCTGTATGGTGGTTTTTACC 
ATGGCCTATATCATGCTCTGTCGGAAGACATCCCAACAAGAGCATGTCTACGAAGCAGCC 
AGGTAAGAAAGTCTCTCCTCTTCCATTTTTGACCCCGTCCCTGCCCTCAATTTTGATTAC 
TGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCACAGACCCAATCCTAAGGCCGGAGGCCT 
TCAGGGTCAGGACATAGCTGCCTTCCCTCTCTCAGGCACCTTCTGAGGTTGTTTTGGCCC 
TCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCCAGAATCCCTGGGTGGT 
AGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGAAACCAGGACCACAG 
CCCCAAGTCCCTTCTTATGGGTGGTGGGCTCTTGGGCCATAGGGCACATGCCAGAGAGGC 
CAACGACTCTGGAGAAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCTCCAGTGATGA 
GCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGACAGGAGTA 
CCAGATCATCGCCCAGATCAATGGCAACTACGCCCGCCTGCTGGACACAGTTCCTCTGGA 
TTATGAGTTTCTGGCCACTGAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGCCAGG 
ATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCTAC 
CTCTCTTCCTGGATAGCCCAAAGTGTCCGCCTACCAACACTGGAGCCGCTGGGAGTCACT 
' GGCTTTGCCCTGGAATTTGCCAGATGCATCTCAAGTAAGCCAGCTGCTGGATTTGGCTCT 
GGGCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAA 
GATGCCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATC 
TTGCCACCAGAAGACCCGAGGGAGGCTCAGCTCTGCCAGCTCAGAGGACCAGCTATATCC 
AGGATCATTTCTCTTTCTTCAGGGCCAGACAGCTTTTAATTGAAATTGTTATTTCACAGG 
CCAGGGTTCAGTTCTGCTCCTCCACTATAAGTCTAATGTTCTGACTCTCTCCTGGTGCTC 
AATAAATATCTAATCATAACAGC 
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Figure 32 

MGILLGLLIiLGHLTVDTYGRPILEVPESVTGPWKGDWLPCTYDPLQGYTQVLVKWLVQR 
GSDPVTIFLRDSSGDHIQQAKYQGRLHVSHKVPGDVSLQLSTLEMDDRSHYTCEVTWQTP 
DGNQVVRDKITELRVQKLSVSKPTVTTGSGYGFTVPQGMRISLQCQARGSPPISYIWYKQ 
QTNNQEPIKVATLSTLLFKPAVIADSGSYFCTAKGQVGSEQHSDIVKFVVKDSSKLLJCTK 
TEAPTTMTYPLKATSTVKQSWDWTTDMDGYLGETSAGPGKSLPVFAI ILI ISLCCMWFT 
MAYIMLCRKTSQQEHVYEAAR 
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Figure 33 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCAC 
ATGGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGT 
CGGAGGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCG 
GGGATCGGG 

ATG TCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCAC 
ACTGAGAT CAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCAC CATCAACTGGGG 
CTTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAA 
AAAGTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAG 
GGCCGAGTGGCCTTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCT 
CTGAAGCCCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTG 
TGGAGCCATGTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAA 
GGAGAGCTGACAGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAG 
CCCATTGTGTATTACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCT 
CCCAAATCTAGGATTGACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATG 
TCCTACTCTGGACTGTACCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTG ' 
GTGCGAGTAACTGTACAGTATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGC 
ATAGTGGCTGGAGCCCTGCTGATTTTCCTCTTGGTGTGGCTGCTAATCCGAAGGAAAGAC 
AAAGAAAGATATGAGGAAGAAGAGAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCA 
AAAGCCCGTCTTGTGAAACCCAGCTCCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGT 
TCTTCCTCCACTCGCTCCACAGCAAATAGTGCCTCACGCAGCCAGCGGACACTGTCAACT 
GACGCAGCACCCCAGCCAGGGCTGGCCACCCAGGCATACAGCCTAGTGGGGCCAGAGGTG 
AGAGGTTCTGAACCAAAGAAAGTCCACCATGCTAATCTGACCAAAGCAGAAACCACACCC 
AGCATGATCCCCAGCCAGAGCAGAGCCTTCCAAACGGTC TGAA TTACAATGGACTTGACT 
CCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTCTTCTCGTCATTGGAGCTCAAGTCAC 
CAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCAGTGAGCATTGCACGGAACAGATT 
CAGATGAGGATTTT C CTTATACAATAC CAiU^C^-AGCAAAAGG ATGTAAGCTG ATT CATC T 
GTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGGAAAGCAGGAGTCCAAATCT 
ATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTGAGGTGAATATACCTAAA 
* ACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATTTTCAAGAGGAAATGG 
GATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATTATTAGTTATTCAG 
ACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTACTGAGCTAACCA 
CTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAACTTCATTTGT 
CATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGAAGAGTGA 
ATGAGTTTCTCCCACTCTATACTAATCTC^CTATTTGTATTGAGCCCAAAATAACTATGA 
AAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTTATG 
AGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAATC 
AGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCAT 
TTATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAATGCTGAAAGA 
TTACCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAG 
ACTGGGTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAAGGAAGGTAG 
CCGGGCATGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGA 
GATTATGCCATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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Figure 34 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQ 

KVVrrYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYV 

WSHVILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLP 

PKSRIDYNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCVVRVTVQYVQSIGMVAGAVTG 

IVAGALLIFLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSG 

SSSTRSTANSASRSQRTLSTDAAPQPGLATQAYSLVGPEVRGSEPKKVHHANLTKAETTP 

SM1PSQSRAFQTV 



35 / 133 



WO 00/53758 



PCT/US00/05841 



Figure 35 

CACGCACTTCACCTGGGTCGGGATTCTCAGGTCATGAACGGTCCCAGCCACCTCCGGGCA 
GGGCGGGTGAGGACGGGGACGGGGCGTGTCCAACTGGCTGTGGGCTCTTGAAACCCGAGC 
ATGGCACAGCACGGGGCGATGGGCGCGTTTCGGGCCCTGTGCGGCCTGGCGCTGCTGTGC 
GCGCTCAGCCTGGGTCAGCGCCCCACCGGGGGTCCCGGGTGCGGCCCTGGGCGCCTCCTG 
CTTGGGACGGGAACGGACGCGCGCTGCTGCCGGGTTCACACGACGCGCTGCTGCCGCGAT 
TACCCGGGCGAGGAGTGCTGTTCCGAGTGGGACTGCATGTGTGTCCAGCCTGAATTCCAC 
TGCGGAGACCCTTGCTGCACGACCTGCCGGCACCACCCTTGTCCCCCAGGCCAGGGGGTA 
CAGTCCCAGGGGAAATTCAGTTTTGGCTTCCAGTGTATCGACTGTGCCTCGGGGACCTTC 
TCCGGGGGCC^CGAAGGCCACTGCAAACCTTGGAC^GACTGCACCCAGTTCGGGTTTCTC 
ACTGTGTTCCCTGGGAACAAGACCCACAACGCTGTGTGCGTCCCAGGGTCCCCGCCGGCA 
GAGCCGCTTGGGTGGCTGACCGTCGTCCTCCTGGCCGTGGCCGCCTGCGTCCTCCTCCTG 
ACCTCGGCCCAGCTTGGACTGCACATCTGGCAGCTGAGGAGTCAGTGCATGTGGCCCCGA 
GAGACCCAGCTGCTGCTGGAGGTGCCGCCGTCGACCGAAGACGCCAGAAGCTGCCAGTTC 
CCCGAGGAAGAGCGGGGCGAGCGATCGGCAGAGGAGAAGGGGCGGCTGGGAGACCTGTGG 
GTGTGAGCCTGGCCGTCCTCCGGGGCCACCGACCGCAGCCAGCCCCTCCCCAGGAGCTCC 
CCAGGCCGGAGGGGCTCTGCGTTCTGCTCTGGGCCGGGCCCTGCTCCCCTGGCAGCAGAA 
GTGGGTGCAGGAAGGTGGCAGTGACCAGCGCCCTGGACCATGCAGTTC 
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Figure 36 

MAQHGAMGAFRALCGIiALLCALSLGQRPTGGPGCGPGRLLLGTGTDARCCRVHTTRCCRD 
YPGEECCSEWDCMCVQPEFHCGDPCCTTCRHHPCPPGQGVQSQGKFSFGFQCIDCASGTF 
SGGHEGHCKPWTDCTQFGFLTVFPGNKTHNAVCVPGSPPAEPLGVnLTWLLAVAACVLLL 
TSAQLGLHIWQLRSQCMWPRETQLLLEVPPSTEDARSCQFPEEERGERSAEEKGRLGDLW 
V 



37 / 133 



WO 00/53758 



PCT/US0O/05841 



Figure 37 

GAAAGCTATAGGCTACCCATTCAGCTCCCCTGTCAGAGACTCAAGCTTTGAGAAAGGCTA 
GCAAAGAGCAAGGAAAGAGAGAAAACAACAAAGTGGCGAGGCCCTCAGAGTGAAAGCGTA 
AGGTTCAGTCAGCCTGCTGCAGCTTTGCAGACCTCAGCTGGGCATCTCCAGACTCCCCTG 
AAGG AAGAGC CTTC CTCAC C CAAAC C CACAAAAG 

ATGCTGAAAAAGCCTCTCTCAGCTGTGACCTGGCTCTGCATTTTCATCGTGGCCTTTGTC 

AGCCACCCAGCGTGGCTGCAGAAGCTCTCTAAGCACAAGACACCAGCACAGCCACAGCTC 

AAAGCGGCCAACTGCTGTGAGGAGGTGAAGGAGCTCAAGGCCCAAGTTGCCAACCTTAGC 

AGCCTGCTGAGTGAACTGAACAAGAAGCAGGAGAGGGACTGGGTCAGCGTGGTCATGCAG 

GTGATGGAGCTGGAGAGCAACAGCAAGCGCATGGAGTCGCGGCTCACAGATGCTGAGAGC 

AAGTACTCCGAGATGAACAACCAAATTGACATCATGCAGCTGCAGGCAGCACAGACGGTC 

ACTCAGACCTCCGCAGATGCCATCTACGACTGCTCTTCCCTCTACCAGAAGAACTACCGC 

ATCTCTGGAGTGTATAAGCTTCCTCCTGATGACTTCCTGGGCAGCCCTGAACTGGAGGTG 

TTCTGTGACATGGAGACTTCAGGCGGAGGCTGGACCATCATCCAGAGACGAAAAAGTGGC 

CTTGTCTCCTTCTACCGGGACTGGAAGCAGTACAAGCAGGGCTTTGGCAGCATCCGTGGG 

GACTTCTGGCTGGGGAACGAACACATCCACCGGCTCTCCAGACAGCCAACCCGGCTGCGT 

GTAGAGATGGAGGACTGGGAGGGCAACCTGCGCTACGCTGAGTATAGCCACTTTGTTTTG 

GGCAATGAACTCAACAGCTATCGCCTCTTCCTGGGGAACTACACTGGCAATGTGGGGAAC 

ACGCCCTCCAGTATCATAACAACACAGC CTTCAGCAC CAAGGACAAGGACAATGACAACT 

GCTTGGACAAGTGTGCACAGCTCCGCAAAGGTGGCTACTGGTACAACTGCTGCACAGACT 

CCAACCTCAATGGAGTGTACTACCGCCTGGGTGAGCACAATAAGCACCTGGATGGCATCA 

CCTGGTATGGCTGGCATGGATCTACCTACTCCCTCAAACGGGTGGAGATGAAAATCCGCC 

CAGAAGACTTCAAGCCT TAAA AGGAGGCTGCCGTGGAGCACGGATACAGAAACTGAGACA 

CGTGGAGACTGGATGAGGGCAGATGAGGACAGGAAGAGAGTGTTAGAA 

AGGGTAGGACTGAGAAACAGCCTATAATCTjCCAAAGAAAGAATAAGTCTCCAAGGAGCAC 

AAAAAAATCATATGTACCAAGGATGTTACAGTAAACAGGATGAACTATTTAAACCCACTG 

GGTCCTGCCACATCCTTCTCAAGGTGGTAGACTGAGTGGGGTCTCTCTGCCCAAGATCCC 

TGACATAGCAGTAGCTTGTCTTTTCCACATGATTTGTCTGTGAAAGAAAATAATTTTGAG 

ATCGTTTTATCTATTTTCTCTACGGCTTAGGCTATGTGAGGGCAAAACACAAATCCCTTT 

GCTAAAAAGAACCATATTATTTTGATTCTCAAAGGATAGGCCTTTGAGTGTTAGAGAAAG 

GAGTGAAGGAGGCAGGTGGGAAATGGTATTTCTATTTTTAAATCCAGTGAAATTATCTTG 

AGTCTACACATTATTTTTAAAACACAAAAATTGTTCGGCTGGAACTGACCCAGGCT 

TTGCGGGGAGGAAACTCCAGGGCACTGCATCTGGCGATCAGACTCTGAGCACTGCCCCTG 

CTCGCCTTGGTCATGTACAGCACTGAAAGGAATGAAGCACCAGCAGGAGGTGGACAGAGT 

CTCTCATGGATGCCGGCACAAAACTGCCTTAAAATATTCATAGTTAATACAGGTATATCT 

ATTTTTATTTACTTTGTAAGAAACAAGCTCAAGGAGCTTCCTTTTAAATTTTGTCTC 

GAAATGGTTGAAAACTGAAGGTAGATGGTGTTATAGTTAATAATAAATGCTGTAAATAAG 

CATCTCACTTTGTAAAAATAAAATATTGTGGTTTTGTTTTAAACATTCAACGTTT 

CCTTCTACAATAAACACTTTCAAAATGTG 
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Figure 38 



MLKKPLSAVTWLCIFIVAFVSHPAWLQKLSKHKTPAQPQLKAANCCEEVKELKAQVANLS 
SLLSELNKKQERDWVSVVMQVMELESNSKRMESRLTDAESKYSEMNNQIDIMQLQAAQTV 
TQTSADAIYDCSSLYQKNYRISGVYKLPPDDFLGSPELEVFCDMETSGGGWTIXQRRKSG 
LVSFYRDWKQYKQGFGSIRGDFWLGNEHIHRLSRQPTRLRVEMEDWEGNLRYAEYSHFVL 
GNELNSTOLFLG1TYTGNVGNDALQYHNNTAFSTKDKDNDNCLDKCAQLRKGGYWYNCCTD 
SNLNGVYYRLGEHNKHLDGITWYGWHGSTYSLKRVEMKIRPEDFKP 
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Figure 3 9 

GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACT 

ACTCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACT 

TTACAGTCC CACAGAACCGTCCTCCCAGGAAG CTGAATC CAGCAAGAACA 

ATG GAGGCCAGCGGGAAGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTT 

TTGGGCTTATCTCTGGCGGGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACT 

GAGGGCAGCTCCTTTGTCACCAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTC 

TCCAGGCGGGGGGTTAGGGTTGTTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAG 

GAGACCGCGGATTTGTTGCTAAATGAGAAATTGGACCGTGAGGATCTGTGCGGTCACACA 

GAGCCCTGTGTGCTACGTTTCCAAGTGTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCT 

GAGCTGCAAGTAATAGACATAAACGACCACTCTCCAGTATTTCTGGACAAACAAATGTTG 

GTGAAAGTATCAGAGAGCAGTCCTCCTGGGACTACGTTTCCTCTGAAGAATGCCGAAGAC 

TTAGATGTAGGC CAAAACAATATTGAGAACTATATAATCAGCC CCAACTCCTATTTTCGG 

GTCCTCACCCGCAAACGCAGTGATGGCAGGAAATACCCAGAGCTGGTGCTGGACAAAGCG 

CTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACACTCACAGCACTGGATGGTGGCTCT 

CCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGTCCTGGATGTCAACGATAATGCC 

CCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTGAGGACAGTCCGGTAGGCTTC 

CTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTCAACGGAGAGATTTCCTAT 

TCACTTTTCCAAGCTTCAGAAGAGATTGGCAAAACCTTTAAGATCAATCCCTTGACAGGA 

GAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCTATGAAGTCAATATT 

GAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCAC C GTTCTGATTCAAGTGATAGAT 

GTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAATACCTGAGAAC 

GCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAGAAAATGGG 

AAAATTAGTTGCTCCATTCAGGAGGATCrACCCTTCCTCCTGAAATCCGCGGAAAACTTT 

TACAC C CTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACATCACT 

ATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGCTG 

ATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 

CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCA 

GGCACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTC 

ACATCCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGAC 

TACGAGGCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCG 

CTGAGCAGCGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTC 

GTGCTGTACCCGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCC 

GAGCCGGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCGGGCCAGAACGCC 

TGGCTGTCGTACGAGCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCAC 

AATGGCGAGGTGCGCACCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTG 

GTGGTGCTGGTCAAGGACAATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTG 

CTCCTGGTGGACGGCTTCTCCCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAG 

GCCCAGGCCGACTTGCTCACCGTCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTC 

TTCCTCTTTTCGGTGCTCCTGTTCGTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCC 

TCGGTGGGTCGCTGCTTGGTGCCCGAGGGCCCCCTTCCAGGGCATCTTGTGGACATGAGC 

GGCACCAGGACCCTATCCCAGAGCTACCAGTATGAGGTGTGTCTGGCAGGAGGCTCAGGG 

ACCAATGAGTTCAAGTTCCTGAAGCCGATTATCCCCAACTTCCCTCCCCAGTGCCCTGGG 

AAAGAAATACAAGGAAATTCTACCTTCCCCAATAACTTTGGGTTCAATATTCAGTGACCA 

TAGTTGACTTTTACATTCCATAGGTATTTTATTTTGTGGCATTTCCATGCCAATGTTTAT 

TTCCCCCAATTTGTGTGTATGTAATATTGTACGGATTTACTCTTGATTTTTCTCATGTTC 

TTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATTCCTGGTTCTT 
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Figure 40 

MEASGKLICRQRQVLFSFLLLGLSLAGAAEPRSYSWEETEGSSFVTNLAKDLGLEQREF 
SRRGVRWSRGNKLHLQLNQETADLLLNEKLDREDLCGKTEPO.TJRFQVXLESPFEFFQA 
ELQVI D I NDHS PVFLDKQML VKVS ESSP PGTTFPL KNAEDLDVGQNNI ENY IIS PNSYFR 
VLTRKRSDGRKYPELVLDK^DREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNA 
PEFEQPFYRVQISEDSPVGFLWKVSATDVDTGVNGEISYSLFQASEEIGKTFKINPLTG 
EIELKKQLDFEKLQSYEVNIEARDAGTFSGKCTVLIQVIDVNDKAPEVTMSAFTSPIPEN 
APETWALFSVSDLDSGENGKISCSIQEDLPFLLKSAENFYTLLTERPLDRESRAEYNIT 
ITVTDLGTPMLITQLNMTVLIADVNDNAPAFTQTSYTLFVRENNSPALHIRSVSATDRDS 
GTNAQVTYSLLPPQDPHLPLTSLVSINADNGHLFALRSLDYEALQGFQFRVGASDHGSPA 
LSSEALVRVWLDANDNSPFVLYPLQNGSAPCTELVPRAAEPGYLVTKWAVDGDSGQNA 
WLSYQLLKATELGLFGVWAHNGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHV 
LLVDGFSQPYLPLPEAAPTQAQADLLTVYLWALASVSSLFLFSVLLFVAVRLCRRSRAA 

SVGRCLVPEGPLPGHLVDMSGTRTLSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPG 
KEIQGNSTFPNNFGFNIQ 
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Figure 4 1 



GCTCCCAGCCAAGAACCTCGGGGCCGCTGCGCGGTGGGGAGGAGTTCCCCGAAACCCGGC 
^CGCTAAGCGAGGCCTCCTCCTCCCGCAGATCCGAACGGCCTGGGCGGGGTCACCCCGGCT 
GGGACAAGAAGCCGCCGCCTGCCTGCCCGGGCCCGGGGAGGGGGCTGGGGCTGGGGCCGG- 
AGGCGGGGTGTGAGTGGGTGTGTGCGGGGGGCGGAGGCTTGATGCAATCCCGATAAGAAA 
TGCTCGGGTGTCTTGGGCACCTACCCGTGGGGCCCGTAAGGCGCTACTATATAAGGCTGC 
CGGCCCGGAGCCGCCGCGCCGTCAGAGCAGGAGCGCTGCGTCCAGGATCTAGGGCCACGA 
CCATCCCAACCCGGCACTCACAGCCCCGCAGCGCATCCCGGTCGCCGCCCAGCCTCCCGC 
ACCCCCATCGCCGGAGCTGCGCCGAGAGCCCCAGGGAGGTGCC 

ATGCGGAGCGGGTGTGTGGTGGTCCACGTATGGATCCTGGCCGGCCTCTGGCTGGCCGTG 
GCCGGGCGCCCCCTCGCCTTCTCGGACGCGGGGCCCCACGTGCACTACGGCTGGGGCGAC 
CCCATCCGCCTGCGGCACCTGTACACCTCCGGCCCCCACGGGCTCTCCAGCTGCTTCCTG 
CGCATCCGTGCCGACGGCGTCGTGGACTGCGCGCGGGGCCAGAGCGCGCACAGTTTGCTG 
GAGATCAAGGCAGTCGCTCTGCGGACCGTGGCCATCAAGGGCGTGCACAGCGTGCGGTAC 
CTCTGCATGGGCGCCGACGGCAAGATGCAGGGGCTGCTTCAGTACTCGGAGGAAGACTGT 
GCTTTCGAGGAGGAGATCCGCCCAGATGGCTACAATGTGTACCGATCCGAGAAGCACCGC 
CTCCCGGTCTCCCTGAGCAGTGCCAAACAGCGGCAGCTGTACAAGAACAGAGGCTTTCTT 
CCACTCTCTCATTTCCTGCCCATGCTGCCCATGGTCCCAGAGGAGCCTGAGGACCTCAGG 
GGCCACTTGGAATCTGACATGTTCTCTTCGCCCCTGGAGACCGACAGCATGGACCCATTT 
GGGCTTGTCACCGGACTGGAGGCCGTGAGGAGTCCCAGCTTTGAGAAG TAA CTGAGACCA 
TGCCCGGGCCTCTTCACTGCTGCCAGGGGCTGTGGTACCTGCAGCGTGGGGGACGTGCTT 
CTACAAGAACAGTCCTGAGTCCACGTTCTGTTTAGCTTTAGGAAGAAACATCTAGAAGTT 
GTACATATTCAGAGTTTTCCATTGGCAGTG CCAGTTTCTAGCCAATAGACTTGTCTGATC 
ATAACATTGTAAGCCT 

GTAGCTTCCCCAGCTGCTGCCTGGGCCCCCATTCTGCTCCCTCGAGGTTGCTGGACAAGC 
TGCTGCACTGTCTCAGTTCTGCTTGAATACCTCCATCGATGGGGAACTCACTTCCTTTGG 
AAAAATTCTTATGTCAAGCTGAAATTCTCTAATTTTTTCTCATCACTTCCCCAGGAGCAG 
CCAGAAGACAGGCAGTAGTTTTAATTTCAGGAACAGGTGATCCACTCTGTAAAACAGCAG 
GTAAATTTCACTCAACCCCATGTGGGAATTGATCTATATCTCTACTTCCAGGGAC CATTT 
GCCCTTCCCAAATCCCTCCAGGCCAGAACTGACTGGAGCAGGCATGGCCCACCAGGCTTC 
AGGAGTAGGGGAAGCCTGGAGCCCCACTCCAGCCCTGGGACAACTTGAGAATTCCCCCTG 
AGGCCAGTTCTGTCATGGATGCTGTCCTGAGAATAACTTGCTGTCCCGGTGTCACCTGCT 
TCCATCTCCCAGCCCACCAGCCCTCTGCCCACCTCACATGCCTCCCCATGGATTGGGGCC 
TCCCAGGCCCCCCACCTTATGTCAACCTGCACTTCTTGTTCAAAAATCAGGAAAAGAAAA 
GATTTGAAGACCCCAAGTCTTGTCAATAACTTGCTGTGTGGAAGCAGCGGGGGAAGACCT 
AGAACCCTTTCCCCAGCACTTGGTTTTCCAACATGATATTTATGAGTAATTTATTTTGAT 
ATGTACATCTCTTATTTTCTTACATTATTTATGCCCCCAAATTATATTTATGTATGTAAG 
TGAGGTTTGTTTTGTATATTAAAATGGAGTTTGTTTGT 
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Figure 42 

MRSGCVVVHVWILAGLWLAVAGRPLAFSDAGPHVHYGWGDPIRLRHLYTSGPHGLSSCF 
LRIRADGWDCARGQSAHSLLEIKAVALRT/AIKGVHSVRYLCMGADGKMQGLLQYSEE 
DCAFEEEIRPDGYNVYRSEKHRLPVSLSSAKQRQLYKNRGFLPLSHFLPMLPMVPEEPE 
DLRGHLESDMFSSPLETDSMDPFGLVTGLEAVRSPSFEK 



43 / 133 



WO 00/53758 



PCT/US00/05841 



Figure 43 

GGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTC 
CACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGT 
GGTGACTTCCAAGAGTGACTCCGTCGGAGGAAA 

ATGACTCCCCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTC 

CAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAG 

ACACACAGGAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTCCATCGAGAAC 

TCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCC 

TTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGA 

TTACATCTTCTCTATGGCAAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTC 

TGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTC 

ACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCC 

TTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGCGAGCTCAAA 

AGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCC 

TCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGA 

TTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTGTGGAAGCTC 

CAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGC 

GAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGG 

AGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAG 

GACAAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCT7GGGGATTGTGGTACAGAACACC 

AAAGTAGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAG 

AATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCAT 

TGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAAC 

CACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAG 

CACTACCTGAGCCTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTC 

ACCATTGCCGCCTACCTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGAC 

TACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTC 

CTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATC 

TTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTC 

TACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGC 

GCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGAC 

AACTATGGC CCCATCATCTTGGCTGTGCATAGGACTCCAGAGGG CGTCATCTACCCTTCC 

ATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCTG 

GTGTTTCTGTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGC 

CCCCACACCCAAAAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGC 

CTGCCCTGGGCCTTGATCTTCTTCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTC 

TACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCC 

ATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGG 

CTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCC 

CATGTGATGAAGCAGAGATGCGGCCTCGTCGCACACTGCCTGTGGCCCCCGAGCCAGGCC 

CAGCC CCAGGCCAGTCAGCCGCAGACTTTGGAAAG CCCAACGACCATGGAGAGATGGGCC 

GTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCGG 

CTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTGTCC 

CCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAACCCCTGGGCCCAGCC 

CTCATTGCTGGGGGCCAGGCCTTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCC 

CCTGCCTGGGACAGAAATGTGGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCA 

CTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCA 

GACCTTCAGGGCCAGAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGC 

TCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCT 

TCCCTGGGGTTCTCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTC 

CCCGATTCCAATGCTGTTTTTTGGGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTA 

AATGTTTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGT 

GGGCTGGGCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTC 

ACCCTGACCAAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGC 

AAGAACTGTGGACCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGA 

CTGACCTCCTCTGGTGACACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAA 
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GCCCCCAAATAGCTCCAGGCGCCCTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAAC 

ACACACGGGTAGATTGCTGGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTC 

ACTCCTCCTGCCAACATTCAGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGA 

GCTACAGGGACAGGGAGCCATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAG 

TCAGCGTTCAATCTTGACCTTGAAGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTT 

TGTCTTTTGATATTAAAAAGAAGTACATGTTCATTGTAGAGAATTTGGAAACTGTAGAAG 

AGAATCAAGAAGAAAAATAAAAATCAGCTGTTGTAATCG C CTAGCAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAA 
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Figure 44 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIEN 
SEEALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLL 
CFQHQEESLAQGPPLIATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELK 
RDLQLLSQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKL 
QPTAGLQDLHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQ 
DKNS SQ VLGEKVLG I WQNTKVANLTEP WLTFQHQLQ PKNVTLQCVFW VEDPTLSS PGH 
WSSAGCETVRRETQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCVVSALACLV 
TIAAYLCSRVPLPCRRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAI 
FLHFSLLTCLSWMGLEGYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVD 
NYGPIILAVHRTPEGVIYPSMCWIRDSLVSYITNLGLFSLVFLFNMAMIATMVVQILRLR 
PHTQKWSHVLTLLGLSLVLGLPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWS 
MRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 
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Figure 45 

GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAG 

GTGGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACC 

GAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCA 

ATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCG 

GGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGT 

GGTTACTTGGATGATTGTACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGG 

CTTTTCCCAAGACTACAAAAACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAAC 

CTGAAGAGGCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCT 

GTCAAACCATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTGCGAGCTACAAGTAT 

TCTGAAGAAGCCAATAATCTCATTGAAGAATGTGAACAAGCTGAACGACTTGGAGCAGTG 

GATGAATCTCTGAGTGAGGAAACACAGAAGGCTGTTCTTCAGTGGACCAAGCATGATGAT 

TCTTCAGATAACTTCTGTGAAGCTGATGACATT CAGTCCCCTGAAGCTGAATATGTAGAT 

TTGCTTCTTAATCCTGAGCGCTACACTGGTTACAAGGGACCAGATGCTTGGAAAATATGG 

AATGTCATCTACGAAGAAAACTGTTTTAAGCCACAGACAATTAAAAGACCTTTAAATCCT 

TTGG CTTCTGGTCAAGGGACAAGTGAAGAGAACACTTTTTACAGTTGG CTAGAAGGTCTC 

TGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAATGTG 

CATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACAAC 

ATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGG 

CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCA 

TTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTCAGGATGAGGAAAAC 

AAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAG 

AATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGACTTTCGACTG 

CATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTCGTCTGTGG 

GGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAGAAATTG 

ATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGAAATA 

GTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACTTC 

AGGAACTTGTTACAGAATATTCA TTAAA GAAAACAAGCTGATATGTGCCTGTTTCTGGAC 

AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCC 

AAACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACAT 

TTTTAAAAATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAAT 

AATGTGGTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 4 6 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYR 
LFPRLQKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKy 
S EEANNL I EE CEQAERLGAVDES LS EETQKAVLQWTKHDDS S DNFCEADD I QS PEAEYVD 
LLLNPERYTGYKGPDAWKI WNVI YEENCFKPQTI KRPLNPLASGQGTSEENTFYSWLEGL 
CVEKRAFYRL I SGLHAS INVHLS ARYLLQETWLEKKWGHNI TEFQQRFDG ILTEGEGPRR 
LKNLYFLYLIELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDE 
NSFFAGDKKEAHKLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKL 
IANMPESGPSYEFHLTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 
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Figure 47 

GCCACGTTGTCTTCTTTCCTTCACCACCACCCAGGAGCTCAGAGATCTAAGCTGCTTTCC 
ATCTTTTCTCCGAGCCCCAGGACACTGACTCTGTACAGG 

ATGGGGCCGTCCTCTTGCCTCCTTCTCATCCTAATCCCCCTTCTCCAGCTGATCAACCGG 
GGGAGTACTCAGTGTTCCTTAGACTCCGTTATGGATAAGAAGATCAAGGATGTTCTCAAC 
AGTCTAGAGTACAGTCCCTCTCCTATAAGCAAGAAGCTCTCGTGTGCTAGTGTCAAAAGC 
CAAGGCAGACCGTCCTCCTGCCCTGCTGGGATGGCTGTCACTGGCTGTGCTTGTGGCTAT 
GGCTGTGGTTCGTGGGATGTTCAGCTGGAAACCACCTGCCACTGCCAGTGCAGTGTGGTG 
GACTGGACCACTGCCCGCTGCTGCCACCTGACCTGACAGGGAGGAGGCTGAGAACTCAGT 
TTTGTGACCATGACAGTAATGAAACCAGGGTCCCAACCAAGAAATCTAACTCAAACGTCC 
CACTTCATTTGTTCCATTCCTGATTCTTGGGTAATAAAGACAAACTTTGTACCTCAAAAA 
AAAAAAAAAAAAAAAA 
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Figure 48 

MGPSSCLLLILIPLLQLINPGSTQCSLDSVMDKKIKDVLNSLEYSPSPISPCKLSCASVKS 
QGRPSSCPAGMAVTGCACGYGCGSWDVQLETTCHCQCSVVDWTTARCCHLT 
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Figure 4S 

GGCACGAGGGGGACAGGAGCTAATACCCAGAACTGAGTTGTGTCCTGCTAAGTCCTCTGC 
CACGTACCCACGGG 

ATGAAGAACCTTTCATTTCCCCTCCTTTTCCTTTTCTTCCTTGTCCCTGAACTGCTGGGC 

TCCAGCATGCCACTGTGTCCCATCGATGAAGCCATCGACAAGAAGATCAAACAAGACTTC 

AACTCCCTGTTTCCAAATGCAATAAAGAACATTGGCTTAAATTGCTGGACAGTCTCCTCC 

AGAGGGAAGTTGGCCTCCTGCCCAGAAGGCACAGCAGTCTTGAGCTGCTCCTGTGGCTCT 

GCCTGTGGCTCGTGGGACATTCGTGAAGAAAAAGTGTGTCACTGCCAGTGTGCAAGGATA 

GACTGGACAGCAGCCCGCTGCTGTAAGCTGCAGGTCGCTTCCTGATGTCGGGGAAGTGAG 

CGTGGTTTCCAGCACAGCCACCCGTTCCTGTAGCTCCAGAGATGTCTGATGTCCTCCGGT 

CTCTACAGGCACCTGCACTCACGTGCGCGAATCCACACACAAGCACACATACTTAAAAAT ' 

AAAACAAAACAGGCTGGAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 50 

MKNLSFPLLFLFFLVPELLGS3MPLCPIDEAIDKKIKQDFNSLFPNAIKNIGLNCWTVS 
SRGKLASCPSGTAVLSCSCGSACGSWDIREEKVCKCQCARIDWTAARCCKLQVAS 
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Figure SI 

CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGA 
GCACAGAGGAGTGGGCCGGGACC 

ATGCGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGCTGCCTGCGGAGAGCTGGCG 
CCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGTCGGACTGTGTCACCATC 
GCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTACTCCCGGGAGATAGTG 
TACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAAGTGTAAGCCCTCG 
GATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTGAGCTGTGCAAT 
GTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTCCTCCCACTC 
TTGAGCCTCCGACTGTAGAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCAGCCCCGA 
ATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAAAAAAAAAAAA 
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Figure 52 

mrgtrlall^vlaacgelapalrc^ 
ypf<^dstvtkscaskckpsdvdgigqtlpvsccnt^ 

LSLRL 
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Figure 53 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTT 
TGGGCAGAAAGGAGGGTGCT7CGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAG 
AACAATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAG 
ATGGCTGAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGA 
GTCTACCAA 

ATGCAGACTTT CACAATGGTTCTAGAAGAAAT CTGGACAAGTCTTTT CATGTGGTTTTT C 
TACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAAC 
CTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCT 
GGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGC 
CACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACT 
GATGACATCACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAG 
ACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGA 
CCTGGGATGGAGATCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGG 
CCCCAGTTTGAGTTCCTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTC 
AAAATGGTGAGGAGTGGGGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCA 
TACTGTGTGAAGGCCCAGACATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAG 
ACAGAATGTGTGGAGGTGCAAGGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTT 
GTTGGCTTCATGCTGATCCTTGTGGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTG 
CTCCAGTACTCCTGTTGCCCCGTGGTGGTCCTCCCAGACACCTTGAAAATAACCAATTCA 
CCCCAGAAGTTAATCAGCTGCAGAAGGGAGGAGGTGGATGCCTGTGCCACGGCTGTGATG 
TCTCCTGAGGAACTCCTCAGGGCCTGGATCTCA TAG GTTTGCGGAAGGGCCCAGGTGAAG 
CCGAGAACC7GGTCTGCATGACATGGAAAC 

CATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGA 
AGAGCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAG 
AACACTGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTG 
AGCAACCCTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGG 
GGAATTACCTACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACAC 
GTACACATAAATACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGT 
CTGACTGATTCAGTGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGA 
CTCTACACACTGGGTGGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGG 
AATCATGGGAGCAATGGTGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATG 
GCTTAGCGAGCTCTACAGTAGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGA 
TGTGCATGAACACGGAGGATCCATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTG 
CAGACAGCAGGTGAAATGTATGTGTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGT 
GCATG7TTGTTGTGCTCCTTTTTTCTGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGG 
GCCACCCTGGCCAAAAGCGGTAAAAAAAAAAAAAAAA 
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Figure 54 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAP 
GETVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQ 
TSAWSILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHV 
KMVRSGGI PVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAI PLVLALFAF 
VGFMLILVWPLFWKMGRLLQYSCCPVVVLPDTLKITNSPQKLISCRREEVDACATAVM 
SPEELLRAWIS 
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Figure 55 

GAGCAGGACGGAGCC 

ATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGCAGGCTGGCTGCTG 
CTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCGTGCAGAAAGCA 
GATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGCGTGGACGTC 
TGCACCGAGGeCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGCAGTGCGG 
GGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGCTTCTG 
GCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTCACC 
TCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTGC 
TACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGC 
TGCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACG 
GCAGCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACT 
CGGGATGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGC 
TGTAACTCTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGG 
CTGCCCCCTCCAGAGCCCACGACTGTGGCGTCAACCACATCTGTCACCACTTCTACCTCG 
GCCCCAGTGAGACCCACATCCACCACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCG 
AGACAGGGAGTAGAACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCC 
GCTGGCCACCAGGACCGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCGAGCAG 
CCCCATAATAAAGGCTGTGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTG 
GCTGCTGGTGTCCTACTGTGAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCT 
GGCCCTGGGTACCCCTCTTCTCATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCA 
GCCCCTGTTTTTCCAACATTCCCCAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTT 
TGGGAAATAAAATACCGTTGTATATATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAG 
CTCCTGTATCCTTCTCATCCTTGTCTCTCCGCTTGTCCTCTTGTGATGTTAGGACAGAGT 
GAGAGAAGTCAGCTGTCACGGGGAAGGTGAGAGAGAGGATGCTAAGCTTCCTACTCACTT 
TCTCCTAGCCAGCCTGGACTTTGGAGCGTGGGGTGGGTGGGACAATGGCTCCCCACTCTA 
AGCACTGCCTCCCCTACTCCCCGCATCTTTGGGGAATCGGTTCCCCA.TATGTCTTCCTTA 
CTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTACCCAATTCGCCCTATAGTGAGTCGTA 
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Figure 56 

MDPARKAGAQAMI WTAGWLLLLLLRGGAQALECYS CVQKADDGCS PNKMKTVKCAPGVDV 
CTEAVGAVETIHGQFSI*AVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLT 
SRALDPAGNESAYPPNGVECYSCVGLSREACQGTSPPWSCYNASDHVYKGCFDGNVTLT 
AANVTVSLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVR 
LPPPEPTTVASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGA 
AGHQDRSNSGQYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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Figure 57 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACC 
TCGTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAAT 

ATGAACACGTGGCTGCTGTTCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTC 

GTGATCATCGGGATGCTCGTGCTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAG 

CTGCTCATCTTCCACATCTACCTGAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAA 

GGCTGGGTGGTCAGAGCTGCTCATCTTACACCTCTACTTGAGTATGTCCCTAACCCTGAG 

CCCCCCACGCCTGGGGCCAGAGTCTTTGTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCC 

TCTCCCAGAAGTGAGATCATGGACAAAAAGGGCAAATCACAGGAAGAAATTAAATCCATG 

AGGACCCAGCAGGCCCAGCAAGAAGCTGAACTCACGCCGAGACCTGCAGGAGTGGTGCCA 

GGTGCTTGAAGTAACAAGTTTAAAATGTTCAGAGACAATGGAATGGAATCTATTAGGCAA 

GAACAGGACA.TTATGAAATAAGGACAGGTGGACTTCCAAAAACACAAGTAGAAATTCTAA 

CAATGAAATATATTACAGGCAGGTCACCCACTAACCAAACAACTGAAGCGAGAGCTGGTG 

GTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTCAGTCATGTTGCTGAACGACG 

GAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGTAACAACAACCTCCCTGCT 

CCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCGTCTTCCATTCTCTGGG 

CAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGACACGGGCAGCAGAGT 

GTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCAACCCCTGGTCAG 

GGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGAGCATCCCCTG 

CCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGAGCCTCCTT 

GTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGATGTTTTA 

AAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGAATTt 

TTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 58 

MNTWLLFLPLFPVQVQTLI WI IGMLVLLLDFLGLVHLGQLLI FHI YLSMSPTLSPRSPQ 
GWWRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSM 
RTQQAQQEAELTPRPAGWPGA 
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Figure 5 9 

AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGA 

ATGGACTGGCCTCACAACCTGCTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGC 
CAGCCCAGGAGCCCCAAAAGCAAGAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCT 
GGCCCTCACCAGGTGCCACTGGACCTGGTGTCACGGATGAAACCGTATGCCCGCATGGAG 
GAGTATGAGAGGAACATCGAGGAGATGGTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCC 
CAGAGAAAGTGTGAGGTCAACTTGCAGCTGTGGATGTCCAACAAGAGGAGCCTGTCTCCC 
TGGGGCTACAGCATCAACCACGACCCCAGCCGTATCCCCGTGGACCTGCCGGAGGCACGG 
TGCCTGTGTCTGGGCTGTGTGAACCCCTTCACCATGCAGGAGGACCGCAGCATGGTGAGC 
GTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCGCCTCTGCCCGCCACCGCCCCGCACA 
GGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCGCTGTGGGCTGCACCTGCATCTTC 
T6AAT.CACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGACCATCCTCCTTGCACCTTTGT 
GCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAAGCAAG 
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Figure 60 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARME 
EYERNIEEMVAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYS INHDPSRI PVDLPEAR 
CLCLGCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 
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Figure 61 

AGATGGTCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGC 
GCGACAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGG 
CTGTGCTGCTGGCTGTAGCTGTCACC 

GGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACGGCGCCCCCACCTGTCGTC 
AGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTGTGGAAAGGGCGGACAGCTCG 
CACCTCAGCATCCTCATTGACCCGCGCTGCCGCGACCTCACCGACAGCTTCGCACGCCTG 
GAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCCCAGCCACGGCTG 
GTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCCCCGGCTGCTG 
GCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATGGCACGCTG 
GGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGCCTCATCCAGCTTCTCTCTGAG 
AGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCTGGATGCCCTGCAG 
AGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCCGG 
GGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 
CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGC 
TTCCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGG 
GAGGACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGG 
CTCACCGGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCC 
TACGAGCTGCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACGGG 
AGCTTCGGCGTGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTG 
GCTGACTATTCCGGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACC 
ACCAAGGACCGTGACAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCC 
TGGTGGTACCGCAACTGCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCAC 
GCCTCCTATGCCGACGGCGTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAG 
TTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTGTCCT 
TGGCCCTGCTGGTCCCTGTCGCCCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTC 
TCCACCCACCTGTGCCTGGCGGACCCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGA 
CACGAAGCTCCCTGGGCCGGTGAAGTCACACATCGCCTTCTCGCCGTCCCCACCCCCTCC 
ATTTGGCAGCTCACTGATCTCTTGCCTCTGCTGATGGGGGCTGGCAAACTTGACGACCCC 
AACTCCTGCCTGCCCCCACTGTGACTCCGGTGCTGTTTGCCGTCCCCTGGCCAGGATGGT 
GGAGTCTGCCCCAGGCACCCTCTGCCCTGCCCGGCCAAATACCCGGCATTATGGGGACAG 
AGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTCCTAGCAGATCGTGGGGAATGTCAGGT 
CTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAGCCCTCCCAATGCCAACCCCCACCC 
CGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 

GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATG 
GGGAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCAC 
CCACCCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGC 
TCTCTGTCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCT 
GGGGCAGCCGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCA 
CCGTGGGGCTGGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAG 
GGCTGGGATGAGGCTTGTACAACCCCCACCACCAATTTCCCAGGGACTCCAGGGTCCTGA 
GGCCTCCCAGGAGGGCCTTGGGGGTGATGACCCCTTCCCTGAGGTGGCTGTCTCCATGAG 
GAGGCCAACCCTTGCCATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGA 
GTGGTCAAGGGACAGGGACCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCA 
GACCAGGCACCACCTGGACACTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCAT 
CCCCACTCCTTGTGTGCACACATGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGC 
CACAAGGGCAGGGCTGGAGCCGGGTCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGT 
GCGTTACGTCAGGCCCAGATGCAGGGCGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCG 
AAAGGGCTGGAGTCAGCCTTGGGGAGCTGCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAG 
GTGGCTTCCTCCAAAGGACACCCGATGGCAGGTGCCTAGGGGGTGTGGGGTTCCGTTCTC 
CCTTCCCCTCCCACTGAAGTTTGTGCTTAAAAAACAATAAATTTGACTTGGCACCACTGG 
GGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTCTGTCCCAGTGCCACCAGGTCATCCAC 
ATGCGCAG 
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Figure 62 

MVNDRWKTMGGAAQLEDRPRDKPQRPSCGYVLCTV^ 

GTAP PPWSTGAASANSALVTVERADS SHLS I LI DPRCPDLTDS FARLESAQASVLQALT 
EHQAQPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQG 
RL I QLLS ESQGHMAHLVNS VSD I LDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGS R 
PRDCLDVLLSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWD 
AYRDGFGRLTGEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPE 
EDGYPLTVADYSGTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLN 
GQ YLRGAHAS YADGVEWS S WTGWQ YSLKF S EMKI RPVREDR 
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Figure 63 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGC 
AGCACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGAC 
CACCCAGCCCCCGGCACAGAGCTGCTCCACAGGCACC 

ATGA GGATCATGCTGCTATTCACAGCCATCCTGGC CTTCAG C CTAGCTCAGAG CTTTGGG 
GCTGTCTGTAAGGAGCCACAGGAGGAGGTGGTTCCTGGCGGGGGCCGCAGCAAGAGGGAT 
CCAGATCTCTACCAGCTGCTCCAGAGACTCTTCAAAAGCCACTCATCTCTGGAGGGATTG 
CTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAAGGAATCAACATCTCCCGAGAAACGT 
GACATGCATGACTTCTTTGTGGGACTTATGGGCAAGAGGAGCGTCCAGCCAGAGGGAAAG 
ACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCCCTTCATCCCAATCAGCTTGGA 
TCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACCTTTATAAGACTCTCCTACG 
GATGTGAATCAAGAGAACGTCCCCAGCTTTGGCATCCTCAAGTATCCCCCGAGAGCAGAA 
TAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTCTTTCCCTGTCCCAAT 
CCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGTAACATTCTTGTGC 
TTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCATAGTGAATATC 
CCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGTCCTACATTA 
AAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 64 

MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGL 
LKALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLG 
STGKSSLGTEEQRPL 
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Figure 65 

CTGACATGGCCTGACTCGGGACAGCTCAGAGCAGGGCAGAACTGGGGACACTCTGGGCCG 
GCCTTCTGCCTGC 

ATGGACGCTCTGAAGCCACCCTGTCTCTGGAGGAACCACGAGCGAGGGAAGAAGGACAGG 
GACTCGTGTGGCAGGAAGAACTCAGAGCCGGGAAGCCCCCATTCACTAGAAGCACTGAGA 
GATGCGGCCCCCTCGCAGGGT 

CTGAATTTCCTGCTGCTGTTCACAAAGATGCTTTTTATCTTTAACTTTTTGTTTTCCCCA 
CTTCCGACCCCGGCGTTGATCTGCATCCTGACATTTGGAGCTGCCATCTTCTTGTGGCTG 
ATCACCAGACCTCAACCCGTCTTACCTCTTCTTGACCTGAACAATCAGTCTGTGGGAATT 
GAGGGAGGAGCACGGAAGGGGGTTTCCCAGAAGAACAATGACCTAACAAGTTGCTGCTTC 
TCAGATGCCAAGACTATGTATGAGGTTTTCCAAAGAGGACTCGCTGTGTCTGACAATGGG 
CCCT.GCTTGGGATATAGAAAACCAAACCAGCCCTACAGATGGCTATCTTACAAACAGGTG 
TCTGATAGAGCAGAGTACCTGGGTTCCTGTCTCTTGCATAAAGGTTATAAATCATCACCA 
GACCAGTTTGTCGGCATCTTTGCTCAGAATAGGCCAGAGTGGATCATCTCCGAATTGGCT 
TGTTACACGTACTCTATGGTAGCTGTACCTCTGTATGACACCTTGGGACCAGAAGCCATC 
GTACATATTGTCAACAAGGCTGATATCGCCATGGTGATCTGTGACACACCCCAAAAGGCA 
TTGGTGCTGATAGGGAATGTAGAGAAAGGCTTCACCCCGAGCCTGAAGGTGATCATCCTT 
ATGGACCCCTTTGATGATGACCTGAAGCAAAGAGGGGAGAAGAGTGGAATTGAGATCTTA 
TCCCTATATGATGCTGAGAACCTAGGCAAAGAGCACTTCAGAAAACCTGTGCCTCCTAGC 
CCAGAAGACCTGAGCGTCATCTGCTTCACCAGTGGGACCACAGGTGACCCCAAAGGAGCC 
ATGATAACCCATCAAAATATTGTTTCAAATGCTGCTGCCTTTCTCAAATGTGTGGAGCAT 
GCTTATGAGCCCACTCCTGATGATGTGGCCATATCCTACCTCCCTCTGGCTCATATGTTT 
GAGAGGATTGTACAGGCTGTTGTGTACAGCTGTGGAGCCAGAGTTGGATTCTTCCAAGGG 
GATATTCGGTTGCTGGCTGACGACATGAAGACTTTGAAGCCCACATTGTTTCCCGCGGTG 
CCTCGACTCCTTAACAGGATCTACGATAAGGTACAAAATGAGGCCAAGACACCCTTGAAG 
AAGTTCTTGTTGAAGCTGGCTGTTTCCAGTAAATTCAAAGAGCTTCAAAAGGGTATCATC 
AGGCATGATAGTTTCTGGGACAAGCTCATCTTTGCAAAGATCCAGGACAGCCTGGGCGGA 
AGGGTTCGTGTAATTGTCACTGGAGCTGCCCCCATGTCCACTTCAGTCATGACATTCTTC 
CGGGCAGCAATGGGATGTCAGGTGTATGAAGCTTATGGTCAAACAGAATGCACAGGTGGC 
TGTACATTTACATTACCTGGGGACTGGACATCAGGTCACGTTGGGGTGCCCCTGGCTTGC 
AATTACGTGAAGCTGGAAGATGTGGCTGACATGAACTACTTTACAGTGAATAATGAAGGA 
GAGGTCTGCATCAAGGGTACAAACGTGTTCAAAGGATACCTGAAGGACCCTGAGAAGACA 
CAGGAAGCCCTGGACAGTGATGGCTGGC 

TTCACACAGGAGACATTGGTCGCTGGCTCCCGAATGGAACTCTGAAGATCATCGACCGTA 
AAAAGAACATTTTCAAGCTGGCCCAAGGAGAATACATTGCACCAGAGAAGATAGAAAATA 
TCTAC^^CAGGAGTCAACCAGTGTTACAAATTTTTGTACACGGGGAGAGCTTACGGTCAT 
CCTTAGTAGGAGTGGTGGTTCCTGACACAGATGTACTTCCCTCATTTGCAGCCAAGCTTG 
GGGTGAAGGGCTCCTTTGAGGAACTGTGCCAAAACCAAGTTGTAAGGGAAGCCATTTTAG 
AAGACTTGCAGAAAATTGGGAAAGAAAGTGGCCTTAAAACTTTTGAACAGGTCAAAGCCA 
TTTTTCTTCATCCAGAGCCATTTTCCATTGAAAATGGGCTCTTGACACCAACATTGAAAG 
CAAAGCGAGGAGAGCTTTCCAAATACTTTCGGACCCAAATTGACAGCCTGTATGAGCACA 
' TCCAGGATTAGGATAAGGTACTTAAGTACCTGCCGGCCCACTGTGCACTGCTTGTGAGAA 
AATGGATTAAAAACTATTCTTACATTTGTTTTGCCTTTCCTCCTATTTTTTTTTAACCTG 
TTAAACTCTAAAGCCATAGCTTTTGTTTTATATTGAGACATATAATGTGTAAACTTAGTT 
CCCAAATAAATCAATCCTGTCTTTCCCATCTTCGATGTTGCTAATATTAAGGCTTCAGGG 
CTACTTTTATCAACATGCCTGTCTTCAAGATCCCAGTTTATGTTCTGTGTCCTTCCTCAT 
GATTTCCAACCTTAATACTATTAGTAACCACilAGTTCAAGGGTCAAAGGGACCCTCTGTG 
CCTTCTTCTTTGTTTTGTGATAAACATAACTTGCCAACAGTCTCTATGCTTATTTACATC 
TTCTACTGTTCAAACTAAGAGATTTTTAAATTCTGAAAAACTGCTTACAATTCATGTTTT 
CTAGCCACTCCACAAAC CACTAAAATTTTAGTTTTAGC CTATCACTCATGTCAATCATAT 
CTATGAGACAAATGTCTCCGATGCTCTTCTGCGTAAATTAAATTGTGTACTGAAGGGAAA 
AGTTTGATCATACCAAACATTTCCTAAACTCTCTAGTTAGATATCTGACTTGGGAGTATT 
AAAAATTGGGTCTATGACATACTGTCCAAAAGGAATGCTGTTCTTAAAGCATTATTTACA- 
GTAGGAACTGGGGAGTAAATCTGTTCCGTACAGTTTGCTGCTGAGCTGGAAGCTGTGGGG 
GAAGGAGTTGACAGGTGGGCCCAGTGAACTTTTCCAGTAAATGAAGCAAGCACTGAATAA 
AAACCTCCTGAACTGGGAACAAAGATCTACAGGC^lAGCAAGATGCCCACACAACAGGCTT 
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Figure 66 

MDALKPPCLWRNHERGKKDRDSCGRKNSEPGSPHSLEALRDAAPSQGLNFLLLFTKMLFI 
FNFLFSPLPTPALI CILTFGAAI FLWLITRPQPVLPLLDLNNQSVGI EGGARKGVSQKNN 
DLTSCCFSDAKTMYEVFQRGLAVSDNGPCLGYRKPNQPYRWLSYKQVSDRAEYLGSCLLH 
KGYKSSPDQFVGI FAQNRPEWI I SELACYTYSMVAVPLYDTLGPEAI VHI VNKADIAMVI 
CDTPQKALVLIGNVEKGFTPSLKVIILMDPFDDDLKQRGEKSGIEILSLYDAENLGKEHF 
RKPVPPSPEDLSVICFTSGTTGDPKGAMITHQNIVSNAAAFLKCVEHAYEPTPDDVAISY 
LPLAHMFER I VQAWYS CGAR VGFFQGDI RLLADDMKTLKPTLF PAVP RLLNR I YDKVQN 
EAKTPLKKFLLKLAVSSKFKELQKGIIRHDSFWDKLIFAKIQDSLGGRVRVIVTGAAPMS 
TSVMTFFRAAMGCQVYEAYGQTECTGGCTFTLPGDWTSGHVGVPl^CWYVKLEDVADMNY 
FTVNNEGEVCIKGTNVFKGYLKDPEKTQEALDSDGWLHTGDIGRWLPNGTLKIIDRKKNI 
FKLAQGEYIAPEKIENIYNRSQPVLQIFVHGESLRSSLVGWVPDTDVLPSFAAKLGVKG 
SFEELCQNQWREAILEDLQKIGKESGLKTFEQVKAIFLHPEPFSIENGLLTPTLKAKRG 
ELSKYFRTQIDSLYEHIQD 



68 / 133 



WO 00/53758 



PCT/US00/O5841 



Figure 67 

GAAAGA 

ATG TTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTGATGCT 

GAACTCTGTCAACCAGGTGCAGAAAATGCTTTTAAAGTGAGACTtAGTATCAGAACAGCT 
CTGGGAGATAAAG CATATG CCTGGGATACCAATGAAGAATACCTCTTCAAAG CGATGGTA 
GCTTTCTCCATGAGAAAAGTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTT 
TGCAATGTAACCCAGAGGGTATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCAC 
ACCCTTCCTG CTGTTGAGGTG CAATCAGCCATAAGAATGAACAAGAAC CGGATCAACAAT 
GCCTTCTTTCTAAATGACCAAACTCTGGAATTTTTAAAAATCCCTTCCACACTTGCACCA 
CCCATGGACCCATCTGTG CC CATCTGGATTATTATATTTGGTGTGATATTTTGCATCATC 
ATAGTTGCAATTGCACTACTGATTTTATCAGGGATCTGGCAACGTAGAAGAAAGAACAAA 
GAACCATCTGAAGTGGATGACGCTGAAGATAAGTGTGAAAACATGATCACAATTGAAAAT 
GGCATCCCCTCTGATCCCCTGGACATGAAGGGGGGCATATTAATGATGCCTTCATGACAG 
AGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGTTGTTCTGCTTCCTCAAGAAATTAAAC 
ATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATACCAAGAGCAGATCATATATTTTGT 
TTCACCATTCTTCTTTtGTAATAAATTTTGAATGTGCTTGAJ^GTGAAAAGCAATCAATT 
ATACCCACCAACACCACTGAAATCATAAGCTATTCACGACTCAAAATATTCTAAAATATT 
TTTCTGACAGTATAGTG 

TATAAATGTGGTCATGTGGTATTTGTAGTTATTGATTTAAGCATTTTTAGAAATAAGATC 
AGGCATATGTATATATTTTCACACTTCAAAGACCTAAGGAAAAATAAATTTTCCAGTGGA 
GAATACATATAATATGGTGTAGAAATCATTGAAAATGGATCCTTTTTGACGATCACTTAT 
ATCACTCTGTATATGACTAAGTAAACAAAAGTGAGAAGTAATTATTGTAAATGGATGGAT 
AAAAATGGAATTACTCATATACAGGGTGGAATTTTATCCTGTTATCACACCAACAGTTGA 
TTATATATTTTCTGAATATCAGCCCCTAATAGGACAATTCTATTTGTTGACCATTTCTAC 
AATTTGTAAAAGTCCMTCTGTGCTAACTTAATAAAGTAATAATCATCTCTTTTTAAAAA 
AAAAAAAAAAAAAAAAAAAAA 
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Figure 68 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRK 
VPNREATE I S HVLL CNVTQR VS FWFWTD P S KNHTL PAVE VQS AI RMNKNR I NNAFFLND 
QTLEFLKI PSTLAPPMDPSVPIWI I IFGVI FCII IVAIALLILSGIWQRRRKNKEPSEVD 
DAEDKCENMITIENGIPSDPLDMKGGILMMPS 
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Figure 69 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCA 
GGGTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTC 
ATG TATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCT 
CTTTGGCCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGG 
ACAGATGCTCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACA 
GTGACCTGGAATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCAC 
ATAGATCCCTTCCAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAAT 
CCTGAGCGGTACGATGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACA 
TACACCTGCCAGGTGAAGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTC 
AGCGTCGTGCACACTGTACGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCT 
GCCTGTGCACTGATGATCATAATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAA 
AAGCGATGGG C CGAAAGAG CTCATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGG 
CTCAACCAAGAGAAAAAGGTCTCTGTTTATTTAGAAGACACAGAC TAA CAATTTTAGATG 
GAAGCTGAGATGATTTCCAAGAACAAGAACCCTAGTATTTCTTGAAGTTAATGGAAACTT 
TTCTTTGGCTTTTCCAGTTGTGACCCGTTTTCC7VACCAGTTCTGCAGCATATTAGATTCT 
AGACAAGCAACACCCCTCTGGAGCCAGCACAGTGCTCCTCCATATCACCAGTCATACACA 
GCCTCATTATTAAGGTCTTATTTAATTTCAGAGTGTAA 

ATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAAGACA 
CTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCCAA 
TTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 
ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGG 
TGTTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATG 
TCTCTCCTTAACTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGA 
TAGATTTTTGTCG 
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Figure 70 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALT 
VTWNFRPLIX3GPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGT 
YTCQVKNPPDVDGVIGEIRLSVVHTVRFSEIHFLALAIGSACALMI I IVIVWLFQHYRK 
KRWAERAHKWEIKSKEEERLNQEKKVSVYLEDTD 
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Figure 7 1 

AAGCAACCAAACTGCAAGCTTTGGGAGTTGTTCGCTGTCCCTGCCCTGCTCTGCTAGGGA 
GAGAACGCCAGAGGGAGGCGGCTGGCCCGGCGGCAGGCTCTCAGAACCGCTACCGGCG 
ATGCTACTGCTGTGGGTGTCGGTGGTCGCAGCCTTGGCGCTGGCGGTACTGGCCCCCGGA 
GCAGGGGAGCAGAGGCGGAGAGCAGCCAAAGCGCCCAATGTGGTG 

CTGGTCGTGAGCGACTCCTTCGATGGAAGGTTAACATTTCATCCAGGAAGTCAGGTAGTG 
AAACTTCCTTTTATCAACTTTATGAAGACACGTGGGACTT^ 

AACTCTCCAATTTGTTGCCCATCACGCGCAGCAATGTGGAGTGGCCTCTTCACTCACTTA- 
ACAGAATCTTGGAATAATTTTAAGGGTCTAGATCCAAATTATACAACATGGATGGATGTC 
ATGGAGAGGCATGGCTACCGAACACAGAAATTTGGGAAACTGGACTATACTTCAGGACAT 
CACTCCATTAGTAATCGTGTGGAAGCGTGGACAAGAGATGTTGCTTTCTTACTCAGACAA 
GAAGGCAGGCCCATGGTTAATCTTATCCGTAACAGGACTAAAGTCAGAGTGATGGAAAGG 
GATTGGCAGAATACAGACAAAGCAGTAAACTGGTTAAGAAAGGAAGCAATTAATTACACT 
GAACCATTTGTTATTTACTTGGGATTAAATTTACCACACCCTTACCCTTCACCATCTTCT 
GGAGAAAATTTTGGATCTTCAACATTTCACACATCTCTTTATTGGCTTGAAAAAGTGTCT 
CATGATGCCATCAAAATCCCAAAGTGGTCACCTTTGTCAGAAATGCACCCTGTAGATTAT 
TACTCTTCTTATACAAAAAACTGCACTGGAAGATTTACAAAAAAAGAAATTAAGAATATT 
AGAGCATTTTATTATGCTATGTGTGCTGAGACAGATGCCATGCTTGGTGAAATTATTTTG 
GCCCTTCATCAATTAGATCTTCTTCAGAAAACTATTGT CATATACTCCTCAGAC CATGGA 
GAGCTGGCCATGGAACATCGACAGTTTTATAAAATGAGCATGTACGAGGCTAGTGCACAT 
GTTCCGCTTTTGATGATGGGACCAGGAATTAAAGCCGGCCTACAAGTATCAAATGTGGTT 
TCTCTTGTGGATATTTACCCTACCATGCTTGATATTGCTGGAATTCCTCTGCCTCAGAAC 
CTGAGTGGATACTCTTTGTTGCCGTTATCATCAGAAACATTTAAGAATGAACATAAAGTC 
AAAAACCTGCATCCACCCTGGATTCTGAGTGAATTCCATGGATGTAATGTGAATGCCTCC 
ACCTACATGCTTCGAACTAACCACTGGAAATATATAGCCTATTCGGATGGTGCATCAATA 
TTGCCTCAACTCTTTGATCTTTCCTCGGATCCAGATGAATTAACAAATGTTGCTGTAAAA 
TTTCCAGAAATTACTTATTCTTTGGATCAGAAGCTTCATTC 

GTTTCTGCTTCTGTCCACCAGTATAATAAAGAGCAGTTTATCAAGTGGAAACAAAGTATA 
GGACAGAATTATTCAAACGTTATAGCAAATCTTAGGTGGCACCAAGACTGGCAGAAGGAA 
CCAAGGAAGTATGAAAATGCAATTGATCAGTGGCTTAAAACCCATATGAATCCAAGAGCA 
GTTTGAACAAAAAGTTTAAAAATAGTGTTCTAGAGATACATATAAATATATTACAAGATC 
ATAATTATGTATTTTAAATGAAACAGTTTTAATAATTACCAAGTTTTGGCCGGGCACAGT 
GGCTCACACCTGTAATCCCAGGACTTTGGGAGGCTGAGGAAAGCAGATCACAAGGTCAAG 
AGATTGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATT 
AGCTGGGCGCGGTGGTGCACACCTATAGTCTCAGCTACTCAGAGGCTGAGGCAGGAGGAT 
CGCTTGAACCCGGGAGGCAGCAGTTGCAGTGAGCTGAGATTGCGCCACTGTACTCCAGCC 
TGGCAACAGAGTGAGACTGTGTCGCAAAAAAATAAAAATAAAATAATAATAATTACCAAT 
TTTTCATTATTTTGTAAGAATGTAGTGTATTTTAAGATAAAATGCCAATGATTATAAAAT 
CACATATTTTCAAAAATGGTTATTATTTAGGCCTTTGTACAATTTCTAACAATTTAGTGG 
AAGTATCAAAAGGATTGAAGCAAATACTGTAACAGTTATGTTCCTTTAAATAATAGAGAA 
TATAAAATATTGTAATAATATGTATCATAAAATAGTTGTATGTGAGCATTTGATGGTGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 72 

MLLLWVSWAALALAVIAPGAGEQRRRA^^ 

NFMieTRGTSFLNAYTNSPICCPSRAAMWSGLFTHLTESWNNFKGLDP^TTWMDVMERH^ 
YRTQKFGKLDYTSGHHSISNRVEAWTRDVAFLLRQEGRPMV^ 

DKAVNWLRKEAINYTEPFVIYLGLNLPHPYPSPSSGENFGSSTFHTSLYWLEKVSHDAIK 
I PKWSPLSEMHPVDYYSSYTKNCTGRFTKKEIKNIRAFYYAMCAETDAMLGEI ILALHQL 
DLLQKTIVIYSSDHGELAMEHRQFYKMSMYEASAHVPLLMMGPGIKAGLQVSNWSLVDI 
YPTMLDIAGIPLPQNLSGYSLLPLSSETFKNEHKVKNLHPPWILSEFHGCNVNASTYMLR 
TNHWKYIAYSDGASILPQLFDLSSDPDELTNVAVKFPEITYSLDQKLHSIINYPKVSASV 
HQYNKEQFIKWKQSIGQNYSNVIANLRWHQDWQKEPRKYENAIDQWLKTHMNPRAV 
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Figure 73 

GACGCCCAGTGACCTGCCGAGGTCGGCAGCACAGAGCTCTGGAG 

ATGAAGACCCTGTTCCTGGGTGTCACGCTCGGCCTGGCCGCTGCCCTGTCCTTCACCCTG 
GAGGAGGAGGATATCACAGGGACCTGGTACGTGAAGGCCATGGTGGTCGATAAGGACTTT 
CCGGAGGACAGGAGGCCCAGGAAGGTGTCCCCAGTGAAGGTGACAGCCCTGGGCGGTGGG 
AAGTTGGAAGCCACGTTCACCTTCATGAGGGAGGATCGGTGCATCCAGAAGAAAATCCTG 
ATGCGGAAGACGGAGGAGCCTGGCAAATACAGCGCCTATGGGGGCAGGAAGCTCATGTAC 
CTGCAGGAGCTGCCCAGGAGGGACCACTACATCTTTTACTGCAAAGACCAGCACCATGGG 
GGCCTGCTCCACATGGGAAAGCTTGTGGGTAGGAATTCTGATACCAACCGGGAGGCCCTG 
GAAGAATTTAAGAAATTGGTGCAGCGCAAGGGACTCTCGGAGGAGGACATTTTCACGCCC 
CTGCAGACGGGAAGCTGCGTTCCCGAACACTAGGCAGCCCCCGGGTCTGCACCTCCAGAG 
CCCACCCTACCACCAGACACAGAGCCCGGACCACCTGGACCTACCCTCCAGCCATGACCC 
TTCCCTGCTCCCACCCACCTGACTCCAAATAAAGTCCTTTTCCCCCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 74 
MKTLFLGVTLGLAAALSFTLEEEDITGTWYV^ 

KLEATFTFMREDRCIQKKILMRKTEEPGKYSAYGGRKLMYLQELPRRDHYIFYCKDQHHG 
GLLHMGKLVGRNSDTNREALEEFKKLVQRKGLSEEDIFTPLQTGSCVPEH 
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Figure 75 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGACCCCAGAC 

ATGA GGAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCA 

GTCCCAGCACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGAC 

CCAGAGAAGGCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTG 

GTGCTGTTCCCTGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAG 

GGCAGGGGCCCCATCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGC 

CGTGTCCTGAGTCCCGAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGAC 

CAGGGCGAGGAGAGGCCCCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCG 

GAGGAAGACCAAGAC CACAT CTACCAC C CC CAGTAGGGCTCCAGGGGCCATCACTGCCCC 

CGCCCTGTCCCAAGGCCCAGGCTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAG 

ACAAATAAACCC CAG CAGGCAAAAAAAAAAAAAAAAAAA 
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Figure 76 

MRRLLLVTS L WVLLWEAGAVPAP KVP I KMQ VKHW P S EQD P EKAWGARWE P PEKDDQL V 
VLFPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEED 
QGEERPRLWVMPNHQVLLGPEEDQDHIYHPQ 
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Figure 77 

GGGAGAGAGGATAAATAGCAGCGTGGCTTCCCTGGCTCCTCTCTGCATCCTTCCCGACCT 
TCCCAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTGCTACTGGG 
GGCCCTGTCTGG 

ATGG GCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGGGCTGAG 
CAATGCAGAGAGAGAGGTGGGCAAGGCCCTGGATGGCATCAACAGTGGAATCACGCATGC 
CGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGGGGAGCCACACCGGCAA 
GGAGTTGGACAAAGGCGTCCAGGiGGCTCAACCACGGCATGGACAAGGTTGCCCATGAGAT 
CAACCATGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCATGGGGTCAACAA 
CGCTGCTGGACAGGCCGGGAAGGAAGCAGACAAAGCGGTCCAAGGGTTCCACACTGGGGT 
CCACCAGGCTGGGAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAACCATGCTGCTGACCA 
GGCTGGAAAGGAAGTGGAGAAGCTTGGCCAAGGTGCCCACCATGCTGCTGGCCAGGCCGG 
GAAGGAGCTGCAGAATGCTCATAATGGGGTCAACCAAGCCAGCAAGGAGGCCAACCAGCT 
GCTGAATGGCAACCATCAAAGCGGATCTTCCAGCCATCAAGGAGGGGCCACAACCACGCC 
GTTAGCCTCTGGGGCCTCAGTCAACACGCCTTTCATCAACCTTCCCGCCCTGTGGAGGAG 
CGTCGCCAACATCATGCCC TAAA CTGGCATCCGGCCTTGCTGGGAGAATAATGTCGCCGT 
TGTCACATCAGCTGACATGACCTGGAGGGGTTGGGGGTGGGGGACAGGTTTCTGAAATCC 
CTGAAGGGGGTTGTACTGGGATTTGTGAATAAACTTGATACACCA 
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Figure 78 

MHLARLVGSCSLLLLLGALSGWAASDDPIEKVIEGINRGLSNAEREVGKALDGINSGITH 
AGREVEK^FNGLSNMGSHTGKELDKGVQGLNHGMDKVAHEINHGIGQAGKEAEKLGHGVN 
NAAGQAGKEADKAVQG FHTGVHQAGKEAE KLGQGVNHAADQAGKEVE KLGQGAHHAAGQA 
GKELQNAHNGVNQASKEANQLLNGNHQSGSSSHQGGATTTPLASGASVNTPFINLPALWR 
SVANIMP 
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Figure 79 

GGACAACCGTTGCTGGGTGTCCCAGGGCCTGAGGCAGGACGGTACTCCGCTGACACCTTC 
CCTTTCGGCCTTGAGGTTCCCAGCCTGGTGGCCCCAGGACGTTCCGGTCGCATGGCAGAG 
TGCTACGGACGACGCCT 

ATGAAGCCCTTAGTCCTTCTAGTTGCGCTTTTGCTATGGCCTTCGTCTGTGCCGGCTTAT 

CCGAGCATAACTGTGACACCTGATGAAGAGCAAAACTTGAATCATTATATACAAGTTTTA 

GAGAACCTAGTACGAAGTGTTCCCTCTGGGGAGCCAGGTCGTGAGAAAAAATCTAACTCT 

CCAAAACATGTTTATTCTATAGCATCAAAGGGATCAAAATTTAAGGAGCTAGTTACACA 

GGAGACGCTTCAACTGAGAATGATGTTTTAACCAATCCTATCAGTGAAGAAACTACAACT 

TTCCCTACAGGAGGCTTCACACCGGAAATAGGAAAGAAAAAACACACGGAAAGTACCCCA 

TTCTGGTCGATCAAAC CAAACAATGTTTCCATTGTTTTGCATGCAGAGGAAC CTTATATT 

GAAAATGAAGAGCCAGAGCCAGAGCCGGAGCCAGCTGCAAAACAAACTGAGGCACCAAGA 

ATGTTGCCAGTTGTTACTGAATCATCTACAAGTCCATATGTTACCTCATACAAGTCACCT 

GTCACCACTTTAGATAAGAGCACTGGCATTGAGATCTCTACAGAATCAGAAGATGTTCCT 

CAGCTCTCAGGTGAAACTGCGATAGAAAAACCCGAAGAGTTTGGAAAGCACCCAGAGAGT 

JGGAATAATGATGACATTTTGAAAAAAATTTTAGATATTAATTCACAAGTGCAACAGGCA 

CTTCTTAGTGACACCAGCAACCCAGCATATAGAGAAGATATTGAAGCCTCTAAAGATCAC 

CTAAAACGAAGCCTTGCTCTAGCAGCAGCAGCAGAACATAAATTAAAAACAATGTATAAG 

TCCCAGTTATTGCCAGTAGGACGAACAAGTAATAAAATTGATGACATCGAAACTGTTATT 

AACATGCTGTGTAATTCTAGATCTAAACTCTATGAATATTTAGATATTAAATGTGTTCCA 

CCAGAGATGAGAGAAAAAGCTGCTACAGTATTCAATACATTAAAAAATATGTGTAGATCA 

AGGAGAGTCACAGCCTTATTAAAAGTTTAT TAA ACAATAATATAAAAATTTTAAACCTAC 

TTGATATTCCATAACAAAGCTGATTTAAGCAAACTGCATTTTTTCACAGGAGAAATAATC 

ATATTCGTAATTTCAAAAGTTGTATAAAAATATTTTCTATTGTAGTTCAAATGTG C GAAC 

ATCTTTATGTGTCATGTGTTATGAACAATTTTCATATGCACTAAAAACCTAATTTAAAAT 

AAAATTTTGGTTCAGGAAAAAA 



81 / 133 



WO 00/53758 



PCT/US00/05841 



Figure 80 

MKPLVLLVALLLWPSSVPAYPSITVTPDEEQNLNHYIQVLENLVRSVPSGEPGREKKSNS 
PKHVYSIASKGSKFKELVTHGDASTENDVLTNPISEETTTFPTGGFTPEIGKKKHTESTP 
FWSIKPmr^SIVLHAEEPYIENEEPEPEPEPAAKQTEAPRMLPVVTESSTSPYVTSYKSP 
VTTLDKSTGIEISTESEDVPQLSGETAIEKPEEFGKHPESWNNDDILKKILDINSQVQQA 
LLSDTSNPAYREDIEASKDHLKRSIAIiAAAAEHKLKTMYKSQLLPVGRTSNKIDDIETVI 
NMLCNSRSKLYEYLDIKCVPPEMREKAATVFNTLKNMCRSRRVTALLKVY 
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Figure 8 1 

TGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTTGGATTTG 
AAAGTTGAGAGCAGC 

ATG TTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTACTGGATTATTCCTTGGGCCTG 
AATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTGGGTGATTCAGCTCTGATG 
GGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGATAGACTGGACTCTGTCA 
CCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTCCAATCTCAGTGTGCCT 
ATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGCAATGATGGCTCT 
CTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGAAATCCGCCTC 
AAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAGAGGAGCCC 
AAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAGAGCACA 
GAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGAGGAG 
ATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACT.CCCAGAGCTGGGGC 
CACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCGGCAATGACGGTTCCATCATG 
CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAAC 
CTGGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTG 
ACCCCGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAA 

TCAGTTGGTGATCATTGTGGGAATTGTCTGTGCCACAATCCTGCTGCTCCCTGTTCTGAT 
ATTGATCGTGAAGAAGACCTGTGGAAATAAGAGTTCAGTGAATTCTACAGTCTTGGTGAA 
GAACACGAAGAAGACTAATCCAGAGATAAAAGAAAAACCCTGCCATTTTGAAAGATGTGA 
AGGGGAGAAACACATTTACTCCCCAATAATTGTACGGGAGGTGATCGAGGAAGAAGAACC 
AAGTGAAAAATCAGAGGCCACCTACATGACCATGCACCCAGTTTGGCCTTCTCTGAGGTC 
AGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGGGAATGCCAAAAACACAGCAAGC 
CTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGGTGGAGACTCTCTCCTGTGTG 
TGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTCCCAGCTGTCCTCCTGTCT 
CATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGGCAGAGAGACTGGACAG 
CTCTGGAGGAACAGG CCTGCTGAGGGGAGGGGAG CATGGACTTGGCCTCTGGAGTGGGAC 
ACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTTGGATCAGACCCT 
CCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATAAAAACCAACC 
CAAATCAA 
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Figure 82 

MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLS 
PGEHAKDEYVL YYYSNLS VP I GRFQNR VHLMGD I LCNDGS LLLQDVQEADQGT Y ICEIRL 
KGESQVFKKAVVLHVLPEEPKELMVHVGGLIQMGCWQSTE^^^ 

IVFRYYHKLRMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGN 
LVFKKTIVLHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVC^TILLLPVLILIVKKTC 
GNKSSVNSTVLVKNTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEAT 
YMTMHPVWPSLRSDRNNSLEKKSGGGMPKTQQAF 
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Figure 83 

GTCGAAGGTTATAAAAGCTTCCAGCCAAACGGCATTGAAGTTGAAGATACAACCTGACAG 
CACAGCCTGAGATCTTGGGGATCCCTCAGCCTAACACCCACAGACGTCAGCTGGTGGATT 
CCCGCTGCATCAAGGCCTACCCACTGTCTCC 

ATG CTGGGCTCTCCCTGCCTTCTGTGGCTCCTGGCCGTGACCTTCTTGGTTCCCAGAGCT 

CAGCCCTTGGCCCCTCAAGACTTTGAAGAAGAGGAGGCAGATGAGACTGAGACGGCGTGG 

CCGCCTTTGCCGGCTGTCCCCTGCGACTACGACCACTGCCGACACCTGCAGGTGCCCTGC 

AAGGAGCTACAGAGGGTCGGGCCGGCGGCCTGCCTGTGCCCAGGACTCTCCAGCCCCGCC 

CAGCCGCCCGACCCGCCGCGCATGGGAGAAGTGCGCATTGCGGCCGAAGAGGGCCGCGCA 

GTGGTCCACTGGTGTGCCCCCTTCTCCCCGGTCCTCCACTACTGGCTGCTGCTTTGGGAC 

GGCAGCGAGGCTGCGCAGAAGGGGCCCCCGCTGAACGCTACGGTCCGCAGAGCCGAACTG 

AAGGGGCTGAAGCCAGGGGGCATTTATGTCGTTTGCGTAGTGGCCGCTAACGAGGCCGGG 

GCAAGCCGCGTGCCCCAGGCTGGAGGAGAGGGCCTCGAGGGGGCCGACATCCCTGCCTTC 

GGGCCTTGCAGCCGCCTTGCGGTGCCGCCCAACCCCCGCACTCTGGTCCACGCGGCCGTC 

GGGGTGGGCACGGCCCTGGCCCTGCTAAGCTGTGCCGCCCTGGTGTGGCACTTCTGCCTG 

CGCGATCGCTGGGGCTGCCCGCGCCGAGCCGCCGCCCGAGCCGCAGGGGCGCTCTGAAAG 

GGGCCTGGGGGCATCTCGGGCACAGACAGCCCCACCTGGGGCGCTCAGCCTGGCCCCCGG 

GAAAGAGGAAAACCCGCTGCCTCCAGGGAGGGCTGGACGGCGAGCTGGGAGCCAGCCCCA 

GGCTCCAGGGCCACGGCGGAGTCATGGTTCTCAGGACTGAGCGCTTGTTTAGGTCCGGTA 

CTTGGCGCTTTGTTTCCTGGCTGAGGTCTGGGAAGGAATAGAAAGGGGCCCCCAATTTTT . 

TTTTAAGCGGCCAGATAATAAATAATGTAACCTTTGCGGTTAAAAAAAAAAAAAAAAAA 
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Figure 84 

MLGSPCLLWLLAVTFLVPRAQPLAPQDFEEEEADETETAWPPLPAVPCDYDHCRHLQVPC 
KELQRVGPAACLCPGLSSPAQPPDPPRMGE^IiWVEEGRAVVHWCAPFSPVLHYWLLLWD 
GSEAAQKGPPLNATVRRAELKGLKPGGIYWCWAANEAGASRVPQAGGEGLEGADIPAF 
GPCSRLAVPPNPRTLVHAAVGVGTALALLSCAALVWHFCLRDRWGCPRRAAARAAGAL 
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Figure 85 

CGACG 

ATGCTACGCGCGCCCGGCTGCCTCCTCCGGACCTCCGTAGCGCCTGCCGCGGCCCTGGCT 
GCGGCGCTGCTCTCGTCGCTTGCGCGCTGCTCTCTTCTAGAGCCGAGGGACCCGGTGGCC 
TCGTCGCTCAGCCCCTATTTCGGCACCAAGACTCGCTACGAGGATGTCAACCCCGTGCTA 
TTGTCGGGCCCCGAGGCTCCGTGGCGGGACCCTGAGCTGCTGGAGGGGACCTGCACCCCG 
GTGCAGCTGGTCGCCCTCATTCGCCACGGCACCCGCTACCCCACGGTCAAACAGATCCGC 
AAGCTGAGGCAGCTGCACGGGTTGCTGCAGGCCCGCGGGTCCAGGGATGGCGGGGCTAGT 
AGTACCGGCAGCCGCGACCTGGGTGCAGCGCTGGCCGACTGGCCTTTGTGGTACGCGGAC 
TGGATGGAC GGGCAGCTAGTAGAGAAGGGACGGCAGGATATG CGACAGCTGGCGCTGCGT 
CTGGCCTCGCTCTTCCCGGCCCTTTTCAGCCGTGAGAACTACGGCCGCCTGCGGCTCATC 
ACCAGTTCCAAGCACCGCTGCATGGATAGCAGCGCCGCCTTCCTGCAGGGGCTGTGGCAG 
CACTACCACCCTGGCTTGCCGCCGCCGGACGTCGCAGATATGGAGTTTGGACCTCCAACA 
GTTAATGATAAACTAATGAGATTTTTTGATCACTGTGAGAAGTTTTTAACTGAAGTAGAA 
AAAAATGCTACAGCTCTTTATCACGTGGAAGCCTTCAAAACTGGACCAGAAATGCAGAAC 
ATTTTAAAAAAAGTTGCAGCTACTTTGCAAGTGCCAGTAAATGATTTAAATGCAGATTTA 
ATTCAAGTAGCCTTTTTCACCTGTTCATTTGACCTGGCAATTAAAGGTGTTAAATCTCCT 
TGGTGTGATGTTTTTGACATAGATGATGCAAAGGTATTAGAATATTTAAATGATCTGAAA 
CAATATTGGAAAAGAGGATATGGGTATACTATTAACAGTCGATCCAGCTGCACCTTGTTT 
CAGGATATCTTTCAGCACTTGGACAAAGCAGTTGAACAGAAACAAAGGTCTCAGCCAATT 
TCTTCTCCAGTCATCCTCCAGTTTGGTCATGCAGAGACTCTTCTTCCACTGCTTTCTCTC 
ATGGGCTACTTCAAAGACAAGGAACCCCTAACAGCGTACAATTACAAAAAACAAATGCAT 
CGGAAGTTCCGAAGTGGTCTCATTGTACCTTATGCCTCGAACCTGATATTTGTGCTTTAC 
CACTGTGAAAATGCTAAGACTCCTAAAGAACAATTCCGAGTGCAGATGTTATTAAATGAA 
AAGGTGTTACCTTTGGCTTACTCACAAGAAACTGTTTCATTTTATGAAGATCTGAAGAAC 
CACTACAAGGACATCCTTCAGAGTTGTCAAACCAGTGAAGAATGTGAATTAGCAAGGGCT 
AACAGTACATCTGATGAACTATGAGTAACTGAAGAACATT7T7AATTCTTTAGGAATCTG 
CAATGAGTGATTACATGCTTGTAATAGGTAGGCAATTCCTTGATTACAGGAAGCTTTTAT 
ATTACTTGAGTATTTCTGTCTTTTCACAGAAAAACATTGGGTTTCTCTCTGGGTTTGGAC 
ATGAAATGTAAGAAAAGATTTTTCACTGGAGCAGCTCTCTTAAGGAGAAACAAATCTATT 
TAGAGAAACAGCTGGCCCTGCAAATGTTTACAGAAATGAAATTCTTCCTACTTATATAAG 
AAATCTCACACTGAGATAGAATTGTGATTTCATAATAACACTTGAAAAGTGCTGGAGTAA 
CAAAATATCTCAGTTGGACCATCCTTAACTTGATTGAACTGTCTAGGAACTTTACAGATT 
GTTCTGCAGTTCTCTCTTCTTTTCCTCAGGTAGGACAGCTCTAGCATTTTCTTAATCAGG 
AATATTGTGGTAAGCTGGGAGTATCACTCTGGAAGAAAGTAACATCTCCAGATGAGAATT 
TGAAACAAGAAACAGAGTGTTGTAAAAGGACACCTTCACTGAAGCAAGTCGGAAAGTACA 
ATGAAAATAAATATTTTTGGTATTTATTTATGAAAT^ 

CTTTTTACTTCTAGGAAGTCTCAAAAGACCATCTTAAATTATTATATGTTTGGACAATTA 
GCAACAAGTCAGATAGTTAGAATCGAAGTTTTTCAAATCCATTGCTTAGCTAACriT 
ATTCTGTCACTTGGCTTCGATrri^lATAl'inn'CCTATTATATGAAATGTATCTTTTGG^ 
GTTTGATTTTTCTTTCTTTCTTTGTAAATAGTTCTGAGTTCTGTCAAATGCCGTGAAAGT 
ATTTG CTATAATAAAGAAAATTCTTGTGACTTTAAAAAAAAA 
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Figure 86 

MLRAPGCIJ^TSVAPAAAIAAALLSSI^CSLLEPRDPVASSLSPYFGTKTRyEDWPVL 
LSGPEAPWRDPELLEGTCTPVQLVALIRHGTRYPTVKQIRKLRQLHGLLQARGSRDGGAS 
STGSRDLGAALADW PLW YADWMDGQL VEKGRQDMRQLALRLAS LF PALF S R ENYGRLRL I 
TSSKHRCMDSSAAFLQGLWQHYHPGLPPPDVADMEFGPPTVNDKLMRFFDHCEKPLTE^ 
KNATAL YHVEAFKTGPEMQNI LKKVAATLQVP VNDLNADLI QVAFFTCS FDLAI KGVKS P 
WCDVFD I DDAKVLEYLNDLKQYWKRGYGYTI NSRSS CTLFQDI FQHLDKAVEQKQRSQP I 
S S P VI LQFGHAETLLPLLSLMG YFKDKEPLTAYNYKKQMHRKFRSGL I VP YASNL I FVL Y 
HCENAKTPKEQFRVQMLLNEKVLPLAYSQETVSFYEDLKNHYKDILQSCQTSEECELARA 
NSTSDEL 
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Figure 87 

GGGACTACAAGCCGCGCCGCGCTGCCGCTGGCCCCTCAGCAACCCTCGAC 
ATG GCGCTGAGGCGGCCACCGCGACTCCGGCTCTGCGCTCGGCTGCCTGACTTCTTCCTG 
CTGCTGCTTTTCAGGGGCTGCCTGATAGGGGCTGTAAATCTCAAATCCAGCAATCGAACC 
CCAGTGGTACAGGAATTTGAAAGTGTGGAA 

CTGTCTTGCATCATTACGGATTCGCAGACAAGTGACCCCAGGATCGAGTGGAAGAAAATT 
CAAGATGAACAAACCACATATGTGTTTTTTGACAACAAAATTCAGGGAGACTTGGCGGGT 
CGTGCAGAAATACTGGGGAAGACATCCCTGAAGATCTGGAATGTGACACGGAGAGACTCA 
GCCCTTTATCGCTGTGAGGTCGTTGCTCGAAATGACCGCAAGGAAATTGATGAGATTGTG 
ATCGAGTTAACTGTGCAAGTGAAG C CAGTGAC CCCTGTCTGTAGAGTGCCGAAGGCTGTA 
CCAGTAGGCAAGATGGCAACACTGCACTGCCAGGAGAGTGAGGGCCACCCCCGGCCTCAC 
TACAGCTGGTATCGCAATGATGTACCACTGCCCACGGATTCCAGAGCCAATCCCAGATTT 
CGCAATTCTTCTTTCCACTTAAACTCTGAAACAGGCACTTTGGTGTTCACTGCTGTTCAC 
AAGGACGACTCTGGGCAGTACTACTGCATTGCTTCCAATGACGCAGGCTCAGCCAGGTGT 
GAGGAGCAGGAGATGGAAGTCTATGACCTGAACATTGGCGGAATTATTGGGGGGGTTCTG 
'GTTGTCCTTGCTGTACTGGCCCTGATCACGTTGGGCATCTGCTGTGCATACAGACGTGGC 
TACTTCATCAACAATAAACAGGATGGAGAAAGTTACAAGAACCCAGGGAAACCAGATGGA 
GTTAACTACATCCGCACTGACGAGGAGGGCGACTTCAGACACAAGTCATCGTTTGTGATC 
TGAGACCCGCGGTGTGGCTGAGAGCGCACAGAGCGCACGTGCACATACCTCTGCTAGAAA 
CTCCTGTCAAGGCAGCGAGAGCTGATGCACTCGGACAGAGCTAGACACTCATTCAGAAGC 
TTTTCGTTTTGGCCAAAG1TGACCACTACTCTTCTTACTCTAACAAGCCACATGAATAGA 
AGAATTTTCCTCAAGATGGACCCGGTAAATATAACCACAAGGAAGCGAAACTGGGTGCGT 
TCACTGAGTTGGGTTCCTAATCTGTTTCTGGCCTGATTCCCGCATGAGTATTAGGGTGAT 
CTTAAAGAGTTTGCTCACGTAAACGCCCGTGCTGGGCCCTGTGAAGCCAGCATGTTCACC 
ACTGGTCGTTCAGCAGCCACGACAGCACCATGTGAGATGGCGAGGTGGCTGGACAGCACC 
AGCAGCGCATCCCGGCGGGAACCCAGAAAAGGCTTCTTACACA 

GC A .GC CTTACTTCATCGG CCCACAGAC AC CACCG CAGTTTCTT CTTAAAGG CTCTG CTGA 

TCGGTGTTGCAGTGTCCATTGTGGAGAAGCTTTTTGGATCAGCATTTTGTAAAAACAACC 

AAAATCAGGAAGGTAAATTGGTTGCTGGAAGAGGGATCTTGCCTGAGGAACCCTGCTTGT 

CCAACAGGGTGTCAGGATTTAAGGAAAACCTTCGTCTTAGGCTAAGTCTGAAATGGTACT 

GAAATATGCTTTTCTATGGGTCTTGTTTATTTTATAAAATTTTACATCTAAATTTTTGCT 

AAGGATGTATTTTGATTATTGAAAAGAAAATTTCTATTTAAACTGTAAATATATTGTCAT 

ACAATGTTAAATAACCTATTTTTTTAAAAAAGTTCAACTTAAGGTAGAAGTTCCAAGCTA 

CTAGTGTTAAATTGGAAAATATCAATAATTAAGAGTATTTTACCCAAGGAATCCTCTCAT 

GGAAGTTTACTGTGATGTTCCTTTTCTCACACAAGTTTTAGCCTTTTTCACAAGGGAACT 

CATACTGTCTACACATCAGACCATAGTTGCTTAGGAAACCTTTAAAAATT 

CCAGTTAAGCAATGTTGAAATCAGTTTGCATCTCTTCAAAAGAAACCTCTCAGGTTAGCT 

TTGAACTGCCTCTTCCTGAGATGACTAGGACAGTCTGTACCCAGAGGCCACCCAGAAGCC 

CTCAGATGTACATACACAGATGCCAGTCAGCTCCTGGGGTTGCGCCAGGCGCCCCCGCTC 

TAGCTCACTGTTGCCTCGCTGTCTGCCAGGAGGCCCTGCCATCCTTGGGCCCTGGCAGTG 

GCTGTGTCCCAGTGAGCTTTACTCACGTGGGCCTTGCTTCATCCAGCACAGCTCTCAGGT 

GGGCACTGCAGGGACA 

CTGGTGTCTTCCATGTAGCGTCCCAGCTTTGGGCTCCTGTAACAGACCTCTTTTTGGTTA 
TGGATGGCTCACAAAATAGGGCCCCCAATGCTATTTTTTTTTTTTAAGTTTGTTTAATTA 
TTTGTTAAGATTGT CTAAGGC CAAAG GCAATTGC GAAATCAAGTCTGTCAAGTACAATAA 
CATTTTTAAAAGAAAATGGATCCCACTGTTCCTCTTTGCCACAGAGAAAGCACCCAGACG 
CCACAGGCTCTGTCGCATTTCAAAACAAACCATGATGGAGTGGCGGCCAGTCCAGCCTTT 
TAAAGAACGTCAGGTGGAGCAGCCAGGTGAAAGGCCTGGCGGGGAGGAAAGTGAAACGCC 
TGAATCAAAAGCAGTTTTCTAATTTTGACTTTAAATTTTTCATCCGCCGGAGACACTGCT 
CCCATTTGTGGGGGGACATTAGCAACATJC^CTCAGAAGCCTGTGTTCTTCAAGAGCAGGT 
GTTCTCAGCCTCACATGCCCTGCCGTGCTGGACTCAGGACTGAAGTGCTGTAAAGCAAGG 
AGCTGCTGAGAAGGAGCACTCCACTGTGTGCCTGGAGAATGGCTCTCACTACTCACCTTG 
TCTTTCAGCTTCCAGTGTCTTGGGTTTTTTATACTTTGAC^GCTTTTTTTTAATTGCATA 
CATGAGACTGTGTTGACTTTTTTTAGTTATGTGAAACACTTTG C CGCAGGCCGCCTGGCA 
GAGGCAGGAAATGCTCCAGCAGTGGCTCAGTGCTCCCTGGTGTCTGCTGCATGGCATCCT 
GGATGCTTAGCATGCAAGTTCCCTCCATCATTGCCACCTTGGTAGAGAGGGATGGCTCCC 
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CACCCTCAGCGTTGGGGATTCACGCTCCAGCCTCCTTCTTGGTTGTCATAGTGATAGGGT 
AGCCTTATTGCCCCCTCTTCTTATACCCTAAAACCTTCTACACTAGTGCCATGGGAACCA 
GGTCTGAAAAAGTAGAGAGAAGTGAAAGTAGAGTCTGGGAAGTAGCTGCCTATAACTGAG 
ACTAGACGGAAAAGGAATACTCGTGTATTTTAAGATATGAATGTGACTCAAGACTCGAGG 
CCGATACGAGGCTGTGATTCTGCCTTTGGATGGATGTTGCTGTACACAGATGCTACAGAC 
TTGTACTAACACACCGTAATTTGGCATTTGTTTAACCTCATTTATAAAAGCTTCAAAAAA 

ACCCA 
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Figure 88 

MALRRPPRLRLCARLPDFFLLLLFRGCLIGAVNLKSSNRTPWQEFESVELSCIITDSQT 
SDPRIEWKKIQDEQTTYVFFDNKIQGDLAGRAEILGKTSLKIWNVTRRDSALYRCEWAR 
NDRKEIDEIVIELTVQVKPVTPVCRVPKAVPVGKMATLHCQESEGHPRPHYSWYRNDVPL 
PTDSRANPRFRNSSFHLNSETGTLVFTAVHKDDSGQYYCIASNDAGSARCEEQEMEVYDL 
NIGGIIGGVLVVIAVIiALITLGICCAYRRGYFINNKQDGESYKNPGKPDGVNYIRTDEEG 
DFRHKSSFVI 
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Figure 89 

CCCACGCGTCCGGACAAACTGGAGGTGAAAGGAGCTGGTACTGTCCACTGTGCTGTCGGT 
GCTGAACCTGAGACGCGAGCGGACCAGTTGCTCCAGCACCTGAAGGCAACGCCCTCTTGC 
ACCCTCTGTGCCCTGTGGGACCCGCTTCACCAACAGGACCCATATCAACTTGACAAAGGA 
GTGTGGTATCGGACGTGGGAGAGAGTCCTCTGTTTGCCACCTGGGCGCTCATTCAGGCGT 
GACTTTGGAGATTTCTATAGTTTTAGACC^AACTATU'l'l^l^lT'rCCCCAGCTAAGACGAT 
CTTTTGAGAGTTTTI'TITITTATTGTGATTTATATTTCCACAGCGTTTAGGAATCTTTCT 
GGGGGACTTTTGTGACTGTTAAAATAAGGTGAAAAGCAATAAGG 

ATGTTTAAGTGCTGGTCAGTTGTCTTGGTTCTCGGATTCATTTTTCTGGAGTCGGAAGGA 
AGGCCAACCAAAGAAGGAGGATATGGCCTTAAATCCTATCAGCCTCTAATGAGATTGCGA 
CATAAGCAGGAAAAAAATCAAGAAAGTTCAAGAGTCAAAGGATTTATGATTCAGGATGGC 
CCTTTTGGATCTTGTGAAAATAAGTACTGTGGTTTGGGAAGACACTGTGTTACCAGCAGA 
GAGACAGGGCAAGCAGAATGTGCCTGTATGGACCTTTGCAAACGTCACTACAAACCTGTG 
TGTGGATCTGACGGAGAATTCTATGAAAACCACTGTGAAGTGCACAGAGCTGCTTGCCTG 
AAAAAACAAAAGATTACCATTGTTCACAATGAAGACTGCTTCTTTAAAGGAGATAAGTGC 
AAGACTACTGAATACAGCAAGATGAAAAATATGCTATTAGATTTACAAAATCAAAAATAT 
ATTATGCAAGAAAATGAAAATCCTAATGGCGACGACATATCTCGGAAGAAGCTATTGGTG 
GATCAAATGTTTAAATATTTTGATGCAGACAGTAATGGACTTGTAGATATTAATGAACTA 
ACTCAGGTGATAAAACAGGAAGAACTTGGCAAGGATCTCTTTGATTGTACTTTGTATGTT 
CTATTGAAATATGATGATTTTAATGCTGACAAGCACCTGGCTCTTGAAGAATTTTATAGA 
GCATTCCAAGTGATCCAGTTGAGTCTGCCAGAAGATCAGAAACTAAGCATCACTGCAGCA 
ACTGTGGGACAAAGTGCTGTTCTGAGCTGTGCCATTCAAGGAACCCTGAGACCTCCCATT 
ATCTGGAAAAGGAACAATATTATTCTAAATAATTTAGATTTGGAAGACATCAATGACTTT 
GGAGATGATGGGTCCTTGTATATTACTAAGGTTACCACAACTCACGTTGGCAATTACACC 
TGCTATGCAGATGGCTATGAACAAGTCTATCAGACTCACATCTTCCAAGTGAATGTTCCT 
CCAGTCATCC 
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Figure 90 

MFKCWSVVLVLGFIFLESEGRPTKEGGYGLKSYQPLMRLRHKQEKNQESSRVKGFMIQDG 
PFGSCENKYCGLGRHCVTSRETGQAECACMDLCKRHYKPVCGSDGEFYENHCEVHRAACL 
KKQKIT I VHNEDCF FKGDKCKTTE YSKMKNMLLDLQNQKYI MQENENPNGDD I S RKKLLV 
DQMFKYFDADSNGLVDINELTQVIKQEELGKDLFDCTLYVLLKYDDFNADKHLALEEFYR 
AFQVIQLSLPEDQKLSITAATVGQSAVLSCAIQGTLRPPIIWKRNNIILNNLDLEDINDF 
GDDGSLYITKVTTTHVGNYTCYADGYEQVYQTHIFQVNVPPVI . . . 
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Figure 9 1 



GCAGCGAGCGCCGGGTGCGGCCCTGCCGCCGCAGGGATGTGACCTTCACCGTCGCTTAGC 

CAGGATGACCGGAGCCCGTGTCTCGCGGCGTCCGCGCCTCGCTTCAGCCTCCCGGGTGCT 

CTGACCGCACGCTCCCGGCTGCTAGGCTCCCCGGCACCGGCCTCGCC 

ATGCCGCCACCGCCCGGGCCCGCCGCCGCCCTGGGCACTGCGCTTCTGCTGCTCCTGCTG 

GCTTCCGAGTCTTCTCACACTGTGCTGTTGCGGGCGCGTGAGGCGGCGCAGTTTCTGCGG 

CCCAGGCAGCGCCGC 

GCCTACCAAGTCTTCGAGGAGGCCAAGCAGGGCCACCTGGAACGGGAGTGCGTGGAGGAG 
GTGTGCAGCAAAGAGGAGGCCAGAGAGGTGTTCGAGAACGACCCCGAGACGGAGTATTTC 
TATCCACGATATCAAGAGTGCATGAGAAAATATGGCAGGCCTGAAGAAAAAAACCCAGAT 
TTCGCCAAATGTGTTCAGAACTTGCCTGACCAGTGCACCCCAAACCCTTGTGATAAGAAG 
GGTACTCATATCTGCCAAGACCTCATGGGCAACTTCTTCTGCGTGTGCACAGATGGCTGG 
GGAGGC CGG CTCTGTGACAAAGATGTCAATGAGTGTGTCCAGAAGAATGGGGG CTGCAGC 
CAGGTCTGCCACAACAAACCAGGAAGCTTCCAATGTGCCTGCCATAGTGGCTTCTCGCTT 
GCATCAGACGGCCAGACCTG CCAAGATATCGATGAATGCACAGACTCAGACACCTGTGGG 
GACGCGCGATGCAAGAACTTGCCAGGCTCCTACTCTTGCCTCTGCGATGAGGGATATACA 
TACAGCTCCAAGGAGAAGACCTGCCAAGATGTGGACGAGTGCCAGCAGGATCGCTGTGAG 
CAGACCTGTGTCAACTCCCCAGGCAGCTATACCTGCCACTGTGATGGGCGAGGGGGCCTA 
AAACTATCCCCAGACATGGATACTTGTGAGGACATCTTACCATGTGTGCCCTTCAGCATG 
GCCAAGAGCGTGAAGTCCTTGTACCTGGGCCGCATGTTCAGCGGGACCCCCGTGATTAGA 
CTACGCTTCAAGAGGCTTCAGCCTACCAGGCTGCTGGCTGAATTTGACTTCCGCACTTTT 
GACCCTGAAGGAGTCCTCTTCTTCGCTGGAGGCCGTTCAGACAGCACCTGGATTGTCCTG 
GGCCTAAGAGCTGGGCGGCTTGAGCTGCAGCTTCGGTACAATGGCGTTGGGCGCATCACC 
AGCAGCGGGCCAACCATCAACCACGGCATGTGGCAAACTATCTCCGTGGAAGAGCTGGAA 
CGTAACCTTGTCATCAAGGTCAAC^AAGATGCTGTAATGAAGATCGCGGTAGCTGGGGAG 
CTGTTTCAGCTGGAGAGGGGCCTCTATCACCTGAATCTCACCGTGGGCGGCATTCCCTTC 
AAGGAGAGTGAGCTCGTCCAGCCGATTAACCCTCGCCTGGATGGGTGCATGAGGAGTTGG 
AACTGGCTGAACGGGGAAGACAGCGCCATCCAGGAGACAGTCAAGGCAAACACAAAAATG 
CAGTGCTTCTCTGTGACAGAAAGGGGCTCCTTCTTCCCGGGGAATGGATTTGCTACCTAC 
AGGCTCAACTACACCCGAACATCGCTGGATGTCGGCACGGAAACCACCTGGGAAGTTAAA 
GTTGTGGCTCGGATCCGCCCTGCCACGGACACGGGGGTGCTGCTGGCGCTGGTGGGGGAC 
GACGATGTCGTCATCTCTGTGGCCCTAGTCGACTACCACTCTACAAAGAAGCTCAAGAAG 
CAGTTGGTGGTCCTGGCAGTTGAGGATGTTGCCCTGGCACTGATGGAAATCAAGGTGTGC 
GACAGCCAGGAACACACGGTCACTGTCTCCCTGCGGGAGGGTGAGGCCACCCTAGAAGTG 
GATGGCACAAAGGGCCAGAGTGAAGTGAGCACTGCCCAGCTGCAGGAGCGACTGGACACA 
CTTAAGACACATCTGCAAGGCTCTGTGCACACCTATGTTGGAGGCCTGCCAGAAGTATCG 
GTGATTTCTGCACCCGTCACTGCGTTCTACCGCGGATGCATGACTCTGGAGGTAAACGGG 
AAAATCCTGGACCTGGATACGGCCTCGTACAAGCACAGTGACATCACCTCCCACTCCTGC 
CCGCCTGTGGAGCATGCCACCCCC TAGA CCGAGCTGCAAGAGGGCTCCACACCTAAAGAC 
AAAAATGAAGCAGGGTTTGGACACACAGCACTGGCTCCTCTCGCATQ3TCCTGCAA(^CT 
GGAGCAGCGTGGACCGCCCTTGTGGTTTTTTTTTCTTGAGATCTTTCTTTTTGCCTTGTA 
ACATATCTGTACATAATGGACGGGTGTCGGGTCACCGGCTGCTCAGAGAGAGCCACGTGA 
CCTGGTGGGAGCTGGCTGGAAGGGGCTGGGCTAGAGGGGCTGGCAGTTTGCAGCAGAACG 
GATGTGAAGAAAATAATTCTCTATTATTTTTATTACTACATGCTTCTTTCTGACTCTAAA 
ATATGGAAAATAAAATATTOACAGAAACCTTTTTAAAAAAAAAAAAAAAAA 
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Figure 92 

MP P P PGP AAALGTALLLLLLAS ES S HTVLLRAR EAAQ FLR PRQRRA YQVFEEAKQGHLE 
RECVEEVCSKEEAREVFENDPETEYFYPRYQECMRKYGRPEEKNPDFAKCVQNLPDQCT 
PNPCDKKGTHICQDLMGNFFCVCTDGWGGRLCDKDVNECVQKNGGCSQVCHNKPGSFQC 
ACHSGFSLASDGQTCQDIDECTDSDTCGDARCKNLPGSYSCLCDEGYTYSSKEKTCQDV 
DECQQDRCEQTCVNS PGS YTCHCDGRGGLKLS PDMDTCEDI LPCVP FSMAKS VKSL YLG 
RMFSGTPVIRLRFKRLQPTRLLAEFDFRTFDPEGVLFFAGGRSDSTWIVLGLRAGRLEL 
QLR YNGVGR I TS SGPTI NHGMWQTI S VEELERNLVI KVNKDAVMKI AVAGELFQLERGL 
YHLNLTVGGIPFKESELVQPINPRLDGCMRSWNWLNGEDSAIQETVKANTKMQCFSVTE 
RGSFFPGNGFATYRLNYTRTSLDVGTETTWEVKWARIRPATDTGVLLALVGDDDVVIS 
VALVD YHS TKKLKKQLWLAVEDVALALME I KVCDSQ EHTVTVS LR EGEATL EVDGTKG 
QSEVSTAQLQERLDTLKTHLQGSVHTYVGGLPEVSVISAPVTAFYRGCMTLEVNGKILD 
LDTASYKHSDITSHSCPPVEHATP 
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Figure 93 

AGTCGACTGCGTCCCCTGTACCCGGCGCCAGCTGTGTTCCTGACCCCAGAATAACTCAGG 
GCTGCACCGGGCCTGGCAGCGCrCCGCACACATTTCCTGTCGCGGCCTAAGGGAAACTGT 
TGGCCGCTGGGCCCGCGGGGGGATTCTTGGCAGTTGGGGGGTCCGTCGGGAGCGAGGGCG 
GAGGGGAAGGGAGGGGGAACCGGGTTGGGGAAGCCAGCTGTAGAGGGCGGTGACCGCGCT 
CCAGACACAGCTCTGCGTCCTCGAGCGGGACAGATCCAAGTTGGGAGCAGCTCTGCGTGC 
GGGGCCTCAGAGA 

ATGAGGCCGGCGTTCGCCCTGTGCCTCCTCTGGCAGGCGCTCTGGCCCGGGCCGGGCGGC 
GGCGAACACCCCACTGCCGACCGTGCTGGCTGCTCGGCCTCGGGGGCCTGCTACAGCCTG 
CACCACGCTACCATGAAGCGGCAGGCGGCCGAGGAGGCCTGCATCCTGCGAGGTGGGGCG 
CTCAGCACCGTGCGTGCGGGCGCCGAGCTGCGCGCTGTGCTCGCGCTCCTGCGGGCAGGC 
CCAGGGCCCGGAGGGGGCTCCAAAGACCTGCTGTTCTGGGTCGCACTGGAGCGCAGGCGT 
TCC^CTGCACCCTGGAGAACGAGCCTTTGCGGGGTTTCTCCTGGCTGTCCTCCGACCCC 
GGCGGTCTCGAAAGCGACACGCTGCAGTGGGTGGAGGAGCCCCAACGCTCCTGCACCGCG 
CGGAGATGCGCGGTACTCCAGGCCACCGGTGGGGTCGAGCCCGCAGGCTGGAAGGAGATG 
CGATGCCACCTGCGCGCCAACGGCTACCTGTGCAAGTACCAGTTTGAGGTCTTGTGTCCT 
GCGCCGCGCCCCGGGGCCGCCTCTAACTTGAGCTATCGCGCGCCCTTCCAGCTGCACAGC 
GCCGCTCTGGACTTCAGTCCACCTGGGACCGAGGTGAGTGCGCTCTGCCGGGGACAGCTC 
CCGATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCTGGGACAAACTCTCGGGC 
GATGTGTTGTGTCCCTGCCCCGGGAGGTACCTCCGTGCTGGCAAATGCGCAGAGCTCCCT 
AACTGCCTAGACGACTTGGGAGGCTTTGCCTGCGAATGTGCTACGGGCTTCGAGCTGGGG 
AAGGACGGCCGCTCTTGTGTGACCAGTGGGGAAGGACAGCCGACCCTTGGGGGGACCGGG 
GTGCCCACCAGGCGCCCGCCGGCCACTGCAACCAGCCCCGTGCCGCAGAGAACATGGCCA 
ATCAGGGTCGACGAGAAGCTGGGAGAGACACCACTTGTCCCTGAACAAGACAATTCAGTA 
ACATCTATTCCTGAGATTCCTCGATGGGGATCACAGAGCACGATGTCTACCCTTCAAATG 
TCCCTTCAAGCCGAGTCAAAGGCCACTATCACCCCATCAGGGAGCGTGATTTCCAAGTTT 
AATTCTACGACTTCCTCTGCCACTCCTCAGGC7TTCGACTCCTCCTCTGCCGTGGTCTTC 
ATATTTGTGAGCACAGCAGTAGTAGTGTTGGTGATCTTGACCATGACAGTACTGGGGCTT 
GTCAAGCTCTGCTTTCACGAAAGCCCCTCTTCCCAGCCAAGGAAGGAGTCTATGGGCCCG 
CCGGGCCTGGAGAGTGATCCTGAGCCCGCTGCTTTGGGCTCCAGTTCTGCACATTGCACA 
AACAATGGGGTGAAAGTCGGGGACTGTGATCTGCGGGACAGAGCAGAGGGTGCCTTGCTG 
GCGGAGTCCCCTCTTGGCTCTAGTGATGC ATAG GGAAACAGGGGACATGGGCACTCCTGT 
GAACAGTTTTTCACTTTTGATGAAACGGGGAACCAAGAGGAA 

CTTACTTGTGTAACTGACAATTTCTGCAGAAATCCCCCTTCCTCTAAATTCCCTTTACTC 
CACTGAGGAGCTAAATCAGAACTGCACACTCCTTCCCTGATGATAGAGGAAGTGGAAGTG 
CCTTTAGGATGGTGATACTGGGGGACCGGGTAGTGCTGGGGAGAGATATTTTCTTATGTT 
TATTCGGAGAATTTGGAGAAGTGATTGAACTTTTCAAGACAT^ 

AATATAATTTACATTAAAAAATAATTTCTACCAAAATGGAAAGGAAATGTTCTATGTTGT 
TCAGGCTAGGAGTATATTGGTTCGAAATCCCAGGGAAAAAAATAAAAATAAAAAATTAAA 
GGATTGTTGAT 
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Figure 94 

MRPAFALCLLWQALWPGPGGGEHPTADRAGCSASGACYSLHHATMKRQAAEEACILRGGA 
LSTVRAGAELRAVLALLRAGPGPGGGSKDLLFWVALERRRSHCTLENEPLRGFSWLSSDP 
GGLESDTLQWVEEPQRSCTARRCAVLQATGGVEPAGWKEMRCHLRANGYLCKYQFEVLCP 
APR PGAASNLS YRAP FQLHSAALDFS PPGTEVSALCRGQL PISVTCIADEI GARWDKLSG 
DVLCPCPGRYLRAGKCAELPNCLDDLGGFACECATGFELGKDGRSCVTSGEGQPTLGGTG 
VPTRR P PATATS P VPQRTWP I RVDEKLGETPLVPEQDNSVTS I PE I PRWGSQSTMSTLQM 
SLQAESKATITPSGSVISKFNSTTSSATPQAFDSSSAWFIFVSTAWVLVILTMTVLGL 
VKLCFHESPSSQPRKESMGPPGLESDPEPAALGSSSAHCTNNGVKVGDCDLRDRAEGALL 
AESPLGSSDA 
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Figure 95 

GACTAGTTCTCTTGGAGTCTGGGAGGAGGAAAGCGGAGCCGGCAGGGAGCGAACCAGGAC 

TGGGGTGACGGCAGGGCAGGGGGCGCCTGGCCGGGGAGAAGCGCGGGGGCTGGAGCACCA 

CCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGGAGGG 

GGGACTGCGAGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCT 

ATGAGGCCACTCCTCGTCCTGCTGCTCCTGGGCCTGGCGGCCGGCTCGCCCCCACTGGAC 

GACAACAAGATCCCCAGCCTCTGCCCGGGGCACCCCGGCCTTCCAGGCACGCCGGGCCAC 

CATGGCAGCCAGGGCTTGCCGGGCCGCGATGGCCGCGACGGCCGCGACGGCGCGCCCGGG 

GCTCCGGGAGAGAAAGGCGAGGGCGGGAGGCCGGGACTGCCGGGACCTCGAGGGGACCCC 

GGGCCGCGAGGAGAGGCGGGACCCGCGGGGCCCACCGGGCCTGCCGGGGAGTGCTCGGTG 

CCTCCGCGATCCGCCTTCAGCGCCAAGCGCTCCGAGAGCCGGGTGCCTCCGCCGTCTGAC 

GCACCCTTGCCCTTCGACCGCGTGCTGGTGAACGAGCAGGGACATTACGACGCCGTCACC 

GGCAAGTTCACCTGCCAGGTGCCTGGGGTCTACTACTTCGCCGTCCATG'CCACCGTCTAC 

CGGGCCAGCCTGCAGTTTGATCTGGTGAAGAATGGCGAATCCATTGCCTCTTTCTTCCAG 

TTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGGCCATGGTGAGGCTGGAG 

CCTGAGGACCAAGTGTGGGTGCAGGTGGGTGTGGGTGACTACATTGGCATCTATGCCAGC 

ATCAAGACAGACAGCACCTTCTCCGGATTTCTGGTGTACTCCGACTGGCACAGCTCCCCA 

GTCTTTGCTTAGTGCCCACTGCAAAGTGAGCTCATGCTCTCACTCCTAGAAGGAGGGTGT 

GAGGCTGACAACCAGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACTAG 

GGAGGTGGGGTAGAGCACTCTCCGTCCTGCTGCTGGCAAGGAATGGGAACAGTGGCTGTC 

TGCGATCAGGTCTGGCAGCATGGGGCAGTGGCTGGATTTCTGCCCAAGACCAGAGGAGTG 

TGCTGTGCTGGCAAGTGTAAGTCCCCCAGTTGCTCTGGTCCAGGAGCCCACGGTGGGGTG 

CTCTCTTCCTGGTCCTCTGCTTCTCTGGATCCTCCCCACCCCCTCCTGCTCCTGGGGCCG 

GCCCTTTTCTCAGAGATCACTCAATAAACCTAAGAACCCTCATAAAAAAAAAAAAAAAAA 

AAAAAAAAAAA 
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Figure 96 

MRPLLVLLLLGLAAGS PPLDDNKI PSLCPGHPGLPGTPGHHGSQGLPGRDGRDGRDGAPG 
APGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSD 
APLPFDRVLVNEQGHYDAVTGKFTCQVPGVYYFAVHATVYRASLQFDLVKNGESIASFFQ 
FFGGWPKPASLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDSTFSGFLVYSDWHSSr- 
VFA 
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Figure 97 

TG 

CCCCTGCTGCTGCTGCCCCTGCTGTGGGGGGGGTCCCTGCAGGAGAAGCCAGTGTACGAG 
CTGCAAGTGCAGAAGTCGGTGACGGTGCAGGAGGGCCTGTGCGTCCTTGTGCCCTGCTCC 
TTCTCTTACCCCTGGAGATCCTGGTATTCCTCTCCCCCACTCTACGTCTACTGGTTCCGG 
GACGGGGAGATCCCATACTACGCTGAGGTTGTGGCCACAAACAACCCAGACAGAAGAGTG 
AAGCCAGAGACCCAGGGCCGATTCCGCCTCCTTGGGGATGTCCAGAAGAAGAACTGCTCC 
CTGAGCATCGGAGATGCCAGAATGGAGGACACGGGAAGCTATTTCTTCCGCGTGGAGAGA 
GGAAGGGATGTAAAATATAGCTACCAACAGAATAAGCTGAACTTGGAGGTGACAGCCCTG 
ATAGAGAAACCCGACATCCACTTTCTGGAGCCTCTGGAGTCCGGCCGCCCCACAAGGCTG 
AGCTG'CAGCCTTCCAGGATCCTGTGAAGCGGGACCACCTCTCACATTCTCCTGGACGGGG 
AATGGCCTCAGCCCCCTGGACCCTGAGACCACCCGCTCCTCGGAGCTCACCCTCACCCCC 
AGGCCCGAGGACCATGGCACCAACCTCACCTGTCAGGTGAAACGCCAAGGAGCTCAGGTG 
ACCACGGAGAGAACTGTCCAGCTCAATGTCTCCTATGCTCCACAGAACCTCGCCATCAGC 
ATCTTCTTCAGAAATGGCACAGGCACAGCCCTGCGGATCCTGAGCAATGGCATGTCGGTG 
CCCATCCAGGAGGGCCAGTCCCTGTTCCTCGCCTGCACAGTTGACAGCAACCCCCCTGCC 
TCACTGAGCTGGTTCCGGGAGGGAAAAGCCCTCAATCCTTCCCAGACCTCAATGTCTGGG 
ACCCTGGAGCTGCCTAACATAGGAGCTAGAGAGGGAGGGGAATTCACCTGCCGGGTTCAG 
CATCCGCTGGGCTCCCAGCACCTGTCCTTCATCCTTTCTGTGCAGAGAAGCTCCTCTTCC 
TGCATATGTGTAACTGAGAAACAGCAGGGTTCCTGGCCCCTCGTCCTCACCCTGATCAGG 
GGGGCTCTCATGGGGGCTGGCTTCCTCCTCACCTATGGCCTCACCTGGATCTACTATACC 
AGGTGTGGAGGCCCCCAGCAGAGCAGGGCTGAGAGGCCTGGCTGAGCCCCTCCCGCTCAA 
GACAGAATTGAG GTGTG GACACTTAGCCCTGTGGGACACATG CAGGACATCACTGTCAGC 
TTCTTTCTGGAAGCTCACATCCCACTGACTACCCCTCTTTTCCTTCCTGCCCCATACCCC 
TTCTACTTATTCCCCTCTGCTTGTGAGTCTTGCCCCANCACACCTGCATCCCCATCTGCA 
NCCCATCCCCTCTCCANCTGCCCTTCTCTTCCCTCTCCATCCANCATCTCCAGCCCTGTG 
AAGGGAATGTACTTTCGGTCCTATACCCCATTACCATTACCAAAAGTTACCTTTTTTTTT 
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Figure 98 

PLLLLPLLWGGSLQEKPVYELQVQKSVTVQEGLCVLVPCSFSYPWRSWYSSPPLYVYWFR 
DGEI PYYAEWATNNPDRRVKPETQGRFRLLGDVQKKNCSLS IGDARMEDTGSYFFRVER 
GRDVKYSYQQNKLNLEVTALIEKPDIHFLEPLESGRPTRLSCSLPGSCEAGPPLTFSWTG 
NALSPLDPETTRSSELTLTPRPEDHGTNLTCQVTO^t^AQVTTERTVQLNVSYAPQNLAIS 
IFFRNGTGTALRILSNGMSVPIQEGQSLFLACTVDSNPPASLSWFREGKALNPSQTSMSG 
TLELPNIGAREGGEFTCRVQHPLGSQHLSFILSVQRSSSSCICVTEKQQGSWPLVLTLIR 
GALMGAG FLLTYGLTW I YYTRCGGPQQSRAERPG 
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Figure 99 

TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAG 

TCCTGAACTTGTCTGAAGCCCTTGTCCGTAAGCCTTGAACTACGTTCTTAAATCTATGAA 

GTCGAGGGACCTTTCGCTGCTTTTGTAGGGACTTCTTTCCTTGCTTCAGCAAC 

ATGAGGCTTTTCTTGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGATTGGGGCT 

TTGATCCCTGAACCAGAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGC 

AAGACCAAAGGAGGGGATTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGC 

TCCTTATTTCACTCCACTCACAAACATAACAATGGTCAGCCCATTTGGTTTACCCT 

ATCCTGGAGGCTCTCAAAGGTTGGGACCAGGGCTTGAAAGGAATGTGTGTAGGAGAGAAG 

AGAAAGCTCATCATTCCTCCTGCTCTGGGCTATGGAAAAGAAGGAAAAGGTAAAATTCCC 

CCAGAAAGTACACTGATATTTAATATTGATCTCCTGGAGATTCGAAATGGACCAAGATCC 

CATGAATCATTCCAAGAAATGGATCTTAATGATGACTGGAAACTCTCTAAAGATGAGGTT 

AAAGCATATTTAAAGAAGGAGTTTGAAAAACATGGTGCGGTGGTGAATGAAAGTCATCAT 

GATGCTTTGGTGGAGGATATTTTTGATAAAGAAGATGAAGACAAAGATGGGTTTATATCT 

GCCAGAGAATTTACATATAAACACGATGAGTTATAGAGATACATCTACCCTTTTAATATA 

GCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAAAGAACATTTTATTTTTATACAATGT 

TCTTTCTTGCTTTGTTTTTTATTTTTATATATTTTTTCTGACTCCTATTTAAAGAACCCC 

TTAGGTTTCTAAGTACCCATTTCTTTCTGATAAGTTATTGGGAAGAAAAAGCTAATTGGT 

CTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCACAGATATGAAGCTTTGTTTTAC 

TTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACCACGACATGAGACCAGGT 

TATAGCACAAATTAGCACCCTATATTTCTGCTTCCCTCTATTTTCTCCAAGTTAGAGGTC 

AACATTTGAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCATGTTATAATGAAAT 

AGTTTATGTGTAACTGGCTCTGAGTCTCTGCTTGAGGACCAGAGGAAAATGGTTGTTGGA 

CCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGATGTGCAATGCTGAAGTTAGAAACA 

AGGTTAATAGCCAGGCATGGT 

GGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCGGATCACCTGAGGTTG 
GGAGTTCGAGACCAGCCTGACCAACACGGAGAAACCCTATCTCTACTAAAAATACAAAGT 
AGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCTACCCAGGAAGGCTGAGGCGGCAGA 
ATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAGATCACCTNCAGCCTGGACACTC 
TGTCTCGAAAAAAGAAAAGAACACGGTTAATACCATATNAATATGTATGCATTGAGACAT 
GCTACCTAGGACTTAAGCTGATGAAGCTTGGCTCCTAGTGATTGGTGGCCTATTATGATA 
AATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATGTATCAATATGTTATAG 
ATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGCTAGCGGAATATCCT 
TCCTGGTTCTTTAATGCK3TAGTCTATAGTATATTATACTACAATAACATTGTATCATAAG 
ATAAAGTAGTAAACCAGTCTACATTTTCCCATTTCTGTCTCATCAAAAACTGAAGTTAGC 
TGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGGGCCAAGGAGGGTGGATCAC 
TTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTACTAAAA 
ATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAGGCTG 
AGACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCCAC 
TGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAGAAGCAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 
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Figure 100 



MRLFLWNAVLTLFVTSLIGALI PEPEVKI EVLQKPFICHRKTKGGDLMLVHYEGYLEKDG 
SLFHSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLI I PFALGYGKEGKGKIP 
PESTLIFNIDLLEIRNGPRSHESFQEMDLNDDWKLS?CDEVKAYLKiCEFEKHGAVVNESHH 
DALVEDI FDKEDEDKDGFI SAREFTYKHDEL 
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Figure 101 

GAATTCCGGGCCCCAGGATGCCAACTTTGAATAGG 
ATGAAGACTAC 

AACTTGTTCCCTTCTCATCTGCATCTCCCTGCTCCAGCTGATGGTCCCAGTGAATACTGA 
TGAGACCATAGAGATTATCGTGGAGAATAAGGTCAAGGAACTTCTTGCCAATCCAGCTAA 
CTATCCCTCCACTGTAACGAAGACTCTCTCTTGCACTAGTGTCAAGACTATGAACAGATG 
GGCCTCCTGCCCTGCTGGGATGACTGCTACTGGGTGTGCTTGTGGCTTTGCCTGTGGATC 
TTGGGAGATCCAGAGTGGAGATACTTGCAACTGCCTGTGCTTACTCGTTGACTGGACCAC 
TGCCCGCTGCTGCCAACTGTCC TAA GAATGAAGAGGTGGAGAACCCAGCTTTGATATGAT 
GAATCTAACAAAAACTGCAGTCTCAATTTGGAAATCTGACTCATGTGCCTTTAAATGTGT 
TCATATTGCCCATTTACCCTGCTTCTTGAAATGCTTCTTGAAAAATAAAGACAAATTTGC 
ATGTG • 
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Figure 102 



MKTTTCSLLICISLLQLMVPVOTDETIEIIVENKVKELIjANPAI^PSTVTKTLSCTSVK 
TMNRWASCPAGMTATGCACGFACGSWEIQSGDTCNCLCLLVDWTTARCCQLS 



105 / 133 



WO 00/53758 



PCTAJS00/05841 



Figure 103 

GCCAGGGGAAGAGGGTGATCCGACCCGGGGAAGGTCGCTGGGCAGGGCGAGTTGGGAAAG 
CGGCAGCCCCCGCCGCCCCCGCAGCCCCTTCTCCTCCTTTCTCCC^CGTCCTATCTGCCT 
CTCGCTGGAGGCCAGGCCGTGCAGCATCGAAGACAGGAGGAACTGGAGCCTCATTGGCCG 
GCCCGGGGCGCCGGCCTCGGGCTTAAATAGGAGCTCCGGGCTCTGGCTGGGACCCGACCG 
CTGCCGGCCGCGCTCCCGCTGCTCCTGCCGGGTG 

ATGGAAAACCCCAGCCCGGCCGCCGCCCTGGGCAAGGCCCTCTGCGCTCTCCTGCTGGCC 
ACTCTCGGCGCCGCCGGCCAGCCTCTTGGGGGAGAGTCCATCTGTTCCGCCAGAGCCCCG 
GCCAAATACAGCATCACCTTCACGGGCAAGTGGAGCCAGACGGCCTTCCCCAAGCAGTAC 
CCCCTGTTCCGCCCCCCTGCGCAGTGGTCTTCGCTGCTGGGGGCCGCGCATAGCTCCGAC 
TACAGCATGTGGAGGAAGAACCAGTACGTCAGTAACGGGCTGCGCGACTTTGCGGAGCGC 
GGCGAGGCCTGGGCGCTGATGAAGGAGATCGAGGCGGCGGGGGAGGCGCTGCAGAGCGTG 
CACGAGGTGTTTTCGGCGCCCGCCGTCCCCAGCGGCACCGGGCAGACGTCGGCGGAGCTG 
GAGGTGCAGCGCAGGCACTCGCTGGTCTCGTTTGTGGTGCGCATCGTGCCCAGCCCCGAC 
TGGTTCGTGGGCGTGGACAGCCTGGACCTGTGCGACGGGGACCGTTGGCGGGAACAGGCG 
GCGCTGGACCTGTACCCCTACGACGCCGGGACGGACAGCGGCTTCACCTTCTCCTCCCCC 
AACTTCGCCACCATCCCGCAGGACACGGTGACCGAGATAACGTCCTCCTCTCCCAGCCAC 
CCGGCCAACTCCTTCTACTACCCGCGGCTGAAGGCCCTGCCTCCCATCGCCAGGGTGACA 
CTGCTGCGGCTGCGACAGAGCCCCAGGGCCTTCATCCCTCCCGCCCCAGTCCTGCCCAGC 
AGGGACAATGAGATTGTAGACAGCGCCTCAGTTCCAGAAACGCCGCTGGACTGCGAGGTC 
TCCCTGTGGTCGTCCTGGGGACTGTGCGGAGGCCACTGTGGGAGGCTCGGGACCAAGAGC 
AGGACTCGCTACGTCCGGGTCCAGCCCGCCAACAACGGGAGCCCCTGCCCCGAGCTCGAA 
GAAGAGGCTGAGTGCGTCCCTGATAACTGCGTCTAAGACCAGAGCCCCGCAGCCCCTGGG 
GCCCCCCGGAGCCATGGGGTGTCGGGGGCTCCTGTGCAGGCTCATGCTGCAGGCGGCCGA 
GGGCACAGGGGGTTTCGCGCTGCTCCTGACCGCGGTGAGGCCGCGCCGACCATCTCTGCA 
CTGAAGGGCCCTCTGGTGGCCGGCACGGGCATTGGGAAACAGCCTCCTCCTTTCCCAACC 
TTGCTTCTTAGGGGCCCCCGTGTCCCGTCTGCTCTCAGCCTCCTCCTCCTGCAGGATAAA 
GTCATCCCCAAGGCTCCAGCTACTCTAAATTATGTCTCCTTATAAGTTATTGCTGCTCCA 
GGAGATTGTCCTTCATCGTCCAGGGGCCTGGCTCCCACGTGGTTGCAGATACCTCAGACC 
TGGTGCTCTAGGCTGTGCTGAGCCCACTCTCCCGAGGGCGCATCCAAGCGGGGGCCACTT 
GAGAAGTGAATAAATGGGGCGGTTTCGGAAGCGTCAGTGTTTCCATGTTATGGATCTCTC 
TGCGTTTGAATAAAGACTATCTCTGTTGCTCACAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA - 
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Figure 104 

MENPS PAAALGKALCALLLATLGAAGQPLGGESICSARAPAKYS ITFTGKWSQTAFPKQY 
PLFRP PAQWS S LLGAAHS S D YSMWRKNQYVSNGLRDFAERG EAWALMKE I EAAGEALQS V 
HEVFSAPAVPSGTGQTSAELEVQRRHSLVSFWRIVPSPDWFVGVDSLDLCDGDRWREQA 
ALDLYPYDAGTDSGFTFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRLKALPPIARVT 
LLRLRQSPRAFIPPAPVLPSRDNEIVDSASVPETPLDCEVSLWSSWGLCGGHCGRLGTKS 
RTRYVRVQPANNGSPCPELEEEAECVPDNCV 
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Figure 105 

GCCTAGCCAGGCCAAGA 

ATGCAATTGCCCCGGTGGTGGGAGCTGGGAGACCCCTGTGCTTGGACGGGACAGGGTCGG 
GGGACACGCAGG 

ATGAGCCCCGCGACCACTGG CACATTCTTGCTGACAGTGTACAGTATTTTCTC CAAGGTA 
CACTCCGATCGGAATGTATACCCATCAGCAGGTGTCCTCTTTGTTCATGTTTTGGAAAGA 
GAATATTTTAAGGGGGAATTTCCACCTTACCCAAAACCTGGCGAGATTAGTAATGATCCC 
ATAACATTTAATACAAATTTAATGGGTTAC CCAGACCGACCTGGATGGCTT CGATATATC 
CAAAGGACACCATATAGTGATGGAGTCCTATATGGGTCCCCAACAGCTGAAAATGTGGGG 
AAGCCAACAATCATTGAGATAACTGCCTACAACAGGCGGACCTTTGAGACTGCAAGGCAT 
AATTTGATAATTAATATAATGTCTGCAGAAGACTTCCCGTTGCCATATCAAGCAGAATTC 
TTCATTAAGAATATGAATGTAGAAGAAATGTTGGCCAGTGAGGTTCTTGGAGACTTTCTT 
GGCGCAGTGAAAAATGTGTGGCAGCCAGAGCGCCTGAACGCCATAAACATCACATCGGCC 
CTAGACAGGGGTGGCAGGGTGCCACTTCCCATTAATGACCTGAAGGAGGGCGTTTATGTC 
ATGGTTGGTG CAGATGTC C CGTTTTCTTCTTGTTTACGAGAAGTTGAAAATCCACAGAAT 
CAATTGAGATGTAGTCAAGAAATGGAGCCTGTAATAACATGTGATAAAAAATTTCGTACT 
CAATTTTACATTGACTGGTGCAAAATTTCATTGGTTGATAAAACAAAGCAAGTGTCCACC 
TATCAGGAAGTGATTCGTGGAGAGGGGATTTTACCTGATGGTGGAGAATACAAACCCCCT 
TCTGATTCTTTGAAAAGCAGAGACTATTACACGGATTTCCTAATTACACTGGCTGTGCCC 
TCGGCAGTGGCACTGGTCCTTTTTCTAATACTTGCTTATATCATGTGCTGCCGACGGGAA 
GGCGTGGAAAAGAGAAACATGCAAACACCAGACATCCAACTGGTCCATCACAGTGCTATT 
CAGAAATCTACCAAGGAGCTTCGAGACATGTCCAAGAATAGAGAGATAGCATGGCCCCTG 
TCAACGCTTCCTGTGTTCCACCCTGTGACTGGGGAAATCATACCTCCTTTACACACAGAC 
AACTATGATAGCACAAACATGCCATTGATGCAAACGCAGCAGAACTTGCCACATCAGACT 
CAGATTCCCCAACAGCAGACTACAGGTAAATGGTATCCC TGAA GAAAGAAAACTGACTGA 
AGCAATGAATTTATJWiTCAGACAA 

AATGCATGAGCTTTTCTGG CATATGTTATG CATGTTGGCAGTATTAAGTGTATACCAAAT 
AATACAACATAACTTTCATTTTACTAATGTATTTTTTTGTACTTAAAGCATTTTTGACAA 
TTTGTAAAACATTGATGACTTTATATTTGTTACAATAAAAGTTGATCTTTAAAATAAATA 
TTATTAATGAAGCCTAAAAAAAAAAA 
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Figure 106 

MQLPRWWELGDPCAWTGQGRGTRRMS PATTGTFLLTVYS I FSKVHSDRNVYPSAGVLFVH 
VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA 
ENVGKPT 1 1 E I TAYNRRTFETARHNL 1 1 NI MS AEDFPL PYQAEFF I KNMNVEEMLASEVL 
GDFLGAVKNVWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE 
NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE 
YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH 
HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEIIPPLHTDNYDSTNMPLMQTQQNL 
PHQTQI PQQQTTGKWYP 
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Figure 107 

ATTCTCCTAGAGCATCTTTGGAAGC 

ATGAGGCCACGATGCTGCATCTTGGCTCTTGTCTGCTGGATAACAGTCTTCCTCCTCCAG 
TGTTCAAAAGGAACTACAGACGCTCCTGTTGGCTCAGGACTGTGGCTGTGCCAGCCGACA 
CCCAGGTGTGGGAACAAGATCTACAACCCTTCAGAGCAGTGCTGTTATGATGATGCCATC 
TTATCCTTAAAGGAGACCCGCCGCTGTGGCTCCACCTGCACCTTCTGGCCCTGCTTTGAG 
CTCTGCTGTCCCGAGTCTTTTGGCCCCCAGCAGAAGTTTCTTGTGAAGTTGAGGGTTCTG 
GGTATGAAGTCTCAGTGTCACTTATCTCCCATCTCCCGGAGCTGTACCAGGAACAGGAGG 
CACGTCCTGTACCC ATAAA AACCCCAGGCTCCACTGGCAGACGGCAGACAAGGGGAGAAG 
AGACGAAGCAGCTGGACATCGGAGACTACAGTTGAACTTCGGAGAGAAGCAACTTGACTT 
CAGAGGGATGGCTCAATGACATAGCTTTGGAGAGGAGCCCAGCTGGGGATGGCCAGACTT 
CAGGGGAAGAATGCCTTCCTGCTTCATCCCCTTTCCAGCTCCCCTTCCCGCTGAGAGCCA 
CTTTCATCGGCAATAAAATCCCCCACATTTACCATCT 
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Figure 108 

MRPRCCILALVCTITVFLLQCSKGTTDAPVGSGLWLCQPTPRCGNKIYNPSEQCCYDDAI 
LSLKETRRCGSTCTFWPCFELCCPESFGPQQKFLVKLRVLGMKSQCHLSPISRSCTRNRR 
HVLYP 
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Figure 109 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGT 
GACT 

ATGG CTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTG 
ACCAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAG 
GAGCTGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAG 
CCAGGGCAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGA 
GAGGCAAAACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGAT 
ATCAACACCAACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAG 
GGGGCAGAGATGGAGAGTTCAAAGGAAGACAAGGCAAGG CAGGCTGAGGTAAAGCGG CTC 
TTCCGCCCCATTGAGGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACT 
GACATGCAGATCATGGTACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAA 
GAGAAGATTGCTGCGCTCTTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAG 
GACCTGCTTTCCTTTGGTGGTCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCC 
CTCGTGAAGGAGTATGCTGCGTTTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTC 
CAGGTGGAGGCCATCGAAGGGGGAGCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAG 
CAGCCGCTCACTGCAAAGAAGAAGGTCCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTC 
CCCTATGCCCAGCGGCAGTTCCTGAAGCTCGGGGGGCTGCAGGTCCTGAGGACCCTGGTG 
CAGGAGAAGGGCACGGAGGTGCTCGCCGTGCGCGTGGTCACACTGCTCTACGACCTGGTC 
ACGGAGAAGATGTTCGCCGAGGAGGAGGCTGAGCTGACCCAGGAGATGTCCCCAGAGAAG 
CTGCAGCAGTATCGCCAGGTACACCTCCTGCCAGGCCTGTGGGAACAGGGCTGGTGCGAG 
ATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGATGCCCGTGAGAAGGTGCTGCAGACA 
CTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCGTCAGGACCCCCAGCTCGGCAGG 
ACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCAGCCTGGAGCTGCAGGATGGT 
GAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAACAGCTTGCTGAAGGAGCTG 
AGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGAGGCTGAGGGGTGCCAG 
CGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCTTGGCCATTAAATGG 
AAACCTGAAGGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 110 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEE 
ELDAEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLD 
INTNTYTSQDLKSAIiAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIET 
DMQIMVRLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEP 
LVKEYAAFVLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHF 
PYAQRQFLKLGGLQVLRTLVQEKGTEVLAVRVVTLLYDLVTEKMFAEEEAELTQEMSPEK 
LQQYRQVHLIiPGLWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGR 
TLASLQAEYQVLASLELQDGEDEGYFQELLGSVNSLLKELR 
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Figure 111 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGG 
TTTTGGGATTTTAATTTTCAAACACAGCAGA 

ATGA CATTTTTTCTGTCACTATTATTATTGTTGGTATGTGAAGCTATTTGGAGATCCAftT 

TCAGGAAGCAACACATTGGAGAATGGCTACTTTCTATCAAGAAATAAAGAGAACCACAGT 

CAACC CACACAATCAT CTTTAGAAGACAGTGTGACTC CTACCAAAGCTGTCAAAACCACA ' 

GGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGAGGGTTAATTCTTGGTGCTGAA 

GCCTGGGGCAGGGGTGTAAAGAAAAACAC TTAGA TTCAATGATTGTAAATTTAAGGCAAA - 

TACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATATATACAATAAGGTGAAA 

TTATAAGTACCCTATGCAGTTGGCTGGAC 

AGTTCTAAATTGGACTTTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGT 
GCTTAGATCTACAGGAGATCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCCCC 
TTACAGAATTGACATTTTAAATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGG 
AAGAATGACAGGGAGAAAGGAAAGAAGGGAAAATGTTGCCAAGGAAAAAAAAA- 
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Figure 112 

MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTT 
GKG I VKGRNLDSRGL I LGAEAWGRGVKKNT 
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Figure 113 

TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCA 
CTCCTTGGCCTCCGCAGCCGATCAC 

ATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTGGCACAGGTGTGGCTG 

GTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCCTCAGAACCAGACC 

AGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGGCCAGCGAGGAG 

AAGGC CGGTGAGGAAGAGAAAGCCTGGCTGATGGC CAGCAGGCAGCAGCTTGCCAAGGAG 

ACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGGCAACATG 

GTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCACAGGG 

CCGACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAGCCC 

GGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACTG 

GGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTC 

TTCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCC 

TCACAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCC 

AAACTGTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTC 

AAAGGGAAATGGTTGACCCCATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTG 

GACAAGTACAAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACC 

TTTGACAAGAATTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATG 

CTGGTGGTCCTCATGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACA 

GACTTGGTGGAGACATGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCG 

AAGTTCAAGCTAGATCAGAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGA 

AGAATCTTCTCACCCTTTGCTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAA 

GTATCCAGGGTTTTACGAAGAACAGTGATTGAAGTTGATGAAAGGGGCACTGAGGCAGTG 

GCAGGAATCTTGTCAGAAATTACTGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGG 

CCATTTCATTTCATGATCTATGAAGAAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTG 

GTGAATCCGACTCTCCTATAATTCAGGACATGCATAAGCACTTCGTGCTGTAGTAGATGC 

.TGAATCTGAGGTATCAAACACACACAGGATACCAGCAATGGATGGCAGGGGAGAGTGTTC 

CTTTTGTTCTTAACTAGTTT 

AGGGTGTTCTCAAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGAC 
CCCCAGCAGATGCCTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACAC 
ATACATACCTATGATAAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAAT 
AACAACATTAAGTAAAATGAGTTACTTGAACGCAAGCACTGCAATACCATAACAGTCAAA 
CTGATTATAGAGAAGGCTACTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGAC 
ATTGGGCAAGGGGAGAATTCACATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCA 
TCCCACTACTCAGAATGGCATGCTGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTT 
TCATTTAATGTTTTTGGACCATGGTTGACCATGGTTAACTGAGACTGCAGAAAGCAAAAC 
CATGGATAAGGGAGGACTACTACAAAAGCATTAAATTGATACATATTTTTTAAAAAAAAA 
AAAAAAAAAA 
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Figure 114 

MKWPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRWQAPREEEEDEQEASEEKA 
GEEEKAWLMASRQQLAKETSNFGFSLLRKISMRHDGNMVFSPFGMSLAMTGLMLGATGPT 
ETQIKRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFN 
LSKRYFDTECVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKG 
KWLTPFDPVFTEVDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLV 
VLMEKMGDHI^LEDYLTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRI 
FSPFADLSELSATGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPF 
HFMIYEETSGMLLFLGRWNPTLL 
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Figure 115 

CGGCAACCAGCCGCCGCCACCACCGCTGCCACTGCCGCCCTGCCGGGGCC 

ATGTTCGCTCTGGGCTTGCCCTTCTTGGTGCTCTTGGTGGCCTCGGTCGAGAGCCATCTG 

GGGGTTCTGGGGCCCAAGAACGTCTCGCAGAAAGACGCCGAGTTTGAGCGCACCTACGTG 

GACGAGGTCAACAGCGAGCTGGTCAACATCTACACCTTCAACCATACTGTGACCCGCAAC 

AGGACAGAGGGCGTGCGTGTGTCTGTGAACGTCCTGAACAAGCAGAAGGGGGCGCCGTTG 

CTGTTTGTGGTCCGCCAGAAGGAGGCTGTGGTGTCCTTCCAGGTGCCCCTAATCCTGCGA 

GGGATGTTTCAGCGCAAGTACCTCTACCAAAAAGTGGAACGAACCCTGTGTCAGCCCCCC 

ACCAAGAATGAGTCGGAGATTCAGTTCTTCTACGTGGATGTGTCCACCCTGTCACCAGTC 

AACACCACATACCAGCTCCGGGTCAGCCGCATGGACGATTTTGTGCTCAGGACTGGGGAG 

CAGTTCAGCTTCAATACCACAGCAGCACAGCCCCAGTACTTCAAGTATGAGTTCCCTGAA 

GGCGTGGACTCGGTAATTGTCAAGGTGACCTCCAACAAGGCCTTCCCCTGCTCAGTCATC 

TCCATTCAGGATGTGCTGTGTCCTGTCTATGACCTGGACAACAACGTAGCCTTCATCGGC 

ATGTACCAGACGATGACCAAGAAGGCGGCCATCACCGTACAGCGCAAAGACTTCCCCAGC 

AACAGCTTTTATGTGGTGGTGGTGGTGAAGACCGAAGACCAAGCCTGCGGGGGCTCCCTG 

CCTTTCTACCCCTTCGCAGAAGATGAACCGGTCGATCAAGGGCACCGCCAGAAAACCCTG 

TCAGTGCTGGTGTCTCAAGCAGTCACGTCTGAGGC 

ATACGTCAGTGGGATGCTCTTTTGCCTGGGTATATTTCTCTCCTTTTACCTGCTGACCGT 
CCTCCTGGCCTGCTGGGAGAACTGGAGGCAGAAGAAGAAGACCCTGCTGGTGGCCATTGA 
CCGAGCCTGCCCAGAAAGCGGTCACCCTCGAGTCCTGGCTGATTCTTTTCCTGGCAGTTC 
CCCTTATGAGGGTTACAACTATGGCTCCTTTGAGAATGTTTCTGGATCTACCGATGGTCT 
GGTTGACAGCGCTGGCACTGGGGACCTCTCTTACGGTTACCAGGGCCGCTCCTTTGAACC 
TGTAGGTACTCGGCCCCGAGTGGACTCCATGAGCTCTGTGGAGGAGGATGACTACGACAC 
ATTGACCGACATCGATTCCGACAAGAATGTCATTCGCACCAAGCAATACCTCTATGTGGC 
TGACCTGGCACGGAAGGACAAGCGTGTTCTGCGGAAAAAGTACCAGATCTACTTCTGGAA 
CATTGCCACCATTGCTGTCTTCTATGCCCTTCCTGTGGTGCAGCTGGTGATCACCTACCA 
GACGGTGGTGAATGTCACAGGGAATCAGGACATCTGCTACTACAACTTCCTCTGCGCCCA 
CCCACTGGGCAATCTCAGCGCCTTCAACAACATCCTCAGCAACCTGGGGTACATCCTGCT 
GGGGCTGCTTTTCCTGCTCATCATCCTGCAACGGGAGATCAACCACAACCGGGCCCTGCT 
GCGCAATGACCTCTGTGCCCTGGAATGTGGGATCCCCAAACACTTTGGGCTTTTCTACGC 
CATGGGCACAGCCCTGATGATGGAGGGGCTGCTCAGTGCTTGCTATCATGTGTGCCCCAA 
CTATACCAATTTCCAGTTTGACACATCGTTCATGTACATGATCGCCGGACTCTGCATGCT 
GAAGCTCTACCAGAAGCGGCACCCGGACATCAACGCCAGCGCCTACAGTGCCTACGCCTG 
CCTGGCCATTGTCATCTTCTTCTCTGTGCTGGGCGTGGTCTTTGG 

CAAAGGGAACACGGCGTTCTGGATCGTCTTCTCCATCATTCACATCATCGCCACCCTGCT 

CCTCAGCACGCAGCTCTATTACATGGGCCGGTGGAAACTGGACTCGGGGATCTTCCGCCG 

CATCCTCCACGTGCTCTACACAGACTGCATCCGGCAGTGCAGCGGGCCGCTCTACGTGGA 

CCGCATGGTGCTGCTGGTCATGGGCAACGTCATCAACTGGTCGCTGGCTGCCTATGGGCT 

TATCATGCGCCCCAATGATTTCGCTTCCTACTTGTTGGCCATTGGCATCTGCAACCTGCT 

CCTTTACn^CGCCTTCTACATCATCATGAAGCTCCGGAGTGGGGAGAGGATCAAGCTCAT 

CCCCCTGCTCTGCATCGTTTGCACCTCCGTGGTCTGGGGCTTCGCGCTCTTCTTCTTCTT 

CCAGGGACTCAGCACCTGGCAGAAAACCCCTGCAGAGTCGAGGGAGCACAACCGGGACTG 

CATCCTCCTCGACTTCTTTGACGACCACGACATCTGGCACTTCCTCTCCTCCATCGCCAT 

GTTCGGGTCCTTCCTGGTGTTGCTGACACTGGATGACGACCTGGATACTGTGCAG 

CGGGACAAGATCTATGTCTTCTAGCAGGAGCTGGGCCCTTCGCTTCACCTCAAGGGGCCC 

TGAGCTCCTTTGTGTCATAGACCGGTCACTCTGTCGTGCTGTGGGGATGAGTCCCAGCAC 

CGCTGCCCAGCACTGGATGGCAGCAGGACAGCCAGGTCTAGCTTAGGCTTGGCCTGGGAC 

AGCCATGGGGTGGCATGGAACCTTGCAGCTGCCCTCTGCCGAGGAGCAGGCCTGCTCCCC 

TGGAACCCCCZAGATGTTGGCCAAATTGCTGCTTTCTTCTCAGTGTTGGGGCCTTCCATGG 

GCCCCTGTCCTTTGGCTCTCCATTTGTCCCTTTGCAAGAGGAAGGATGGAAGGGACACCC 

TCCCCATTTCATGCCTTGCATTTTGCCCGTCCTCCTCCCCACAATGCCCCAGCCTGGGAC 

CTAAGGCCTCTTTTTCCTCCCATACTCCCACTCCAGGGCCTAGTCTGGGGCCTGAATCTC 

TGTCCTGTATCAGGGCCCCAGTTCTCTTTGGGCTGTCCCTGGCTGCCATCACTGCCCATT 

CCAGTCAGCCAGGATGGATGGGGGTATGAGATTTTGGGGGTTGGCCAGCTGGTGCCAGAC 

TTTTGGTGCTAAGGCCTGCAAGGGGCCTGGGGCAGTGCGTATTCTCTTCCCTCTGACCTG 

TGCTCAGGGCTGGCTCTTTAGCAATGCGCTCAGCCCAATTTGAGAACCGCCTTCTGATTC 
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AAGAGGCTGAATTCAGAGGTCACCTCTTCATCCCATCAGCTCCCAGACTGATGCCAGCAC 
CAGGACTGGAGGGAGAAGCGCCTCACCCCTTCCCTTCCTTCTTTCCAGGCCCTTAGTCTT 
GCCAAACCCCAGCTGGTGGCCTTTCAGTGCCATTGACACTGCCCAAGAATGTCCAGGGGC 
AAAGGAGGGATGATACAGAGTTCAGC CCGTTCTG 

CCTCCACAGCTGTGGGCACCCCAGTGCCTACCTTAGAAAGGGGCTTCAGGAAGGGATGTG 
CTGTTTCCCTCTACGTGCCCAGTCCTAGCCTCGCTCTAGGACCCAGGGCTGGCTTCTAAG 
TTTCCGTCCAGTCTTCAGGCAAGTTCTGTGTTAGTCATGCACACACATACCTATGAAACC 
TTGGAGTTTACAAAGAATTGCCCCAGCTCTGGGCACCCTGGCCACCCTGGTCCTTGGATC 
CCCTTCGTCCCACCTGGTCCACCCCAGATGCTGAGGATGGGGGAGCTCAGGCGGGGCCTC 
TGCTTTGGGGATGGGAATGTGTTTTTCTCCCAAACTTGTTTTTATAGCTCTGCTTGAAGG 
GCTGGGAGATGAGGTGGGTCTGGATCTTTTCTCAGAGCGTCTCCATGCTATGGTTGCATT 
TCCGTTTTCTATGAATGAATTTGCATTCAATAAACAACCAGACTCAAAAAAAAAAAAAAA 
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Figure 116 

MFALGLPFLVLLVASVESHLGVLGPKNVSQ 

RTEGVRVSVNVLNKQKGAPLLFWRQKEAWSFQVPLILRGMFQRKYLYQKVERTLCQPP 
TKNESEIQFFYVDVSTLSPVNTTYQLRVSRMDDFVLRTGEQFSFNTTAAQPQYFKYEFPE 
GVDSVIVKVTSNKAFPCSVISIQDVLCPVYDLDNNVAFIGM^ 

NSFYVVVVVKTEDQACGGSLPFYPFAEDEPVDQGHRQKTLSVLVSQAVTSEAYVSGMLFC 
LGIFLSFYLLTVLLACWENWRQKKKTLLVAIDRACPESGHPRVIiADSFPGSSPYEGYNYG 
S FENVSGSTDGL VDSAGTGDLS YGYQGRS FE P VGTRPRVDSMS S VEEDDYDTLTD I DSDK 
NVI RTKQ YL YVADLARKDKRVLRKKYQ I YFWNI ATI AVFYAL P WQLVI TYQTWNVTGN 
QDICYYNFLCAHPLGNLSAFNNILSNLGYILLGLLFLLIILQREINHNRALLRNDLCALE 
CGIPKHFGLFYAMGTALMMEGLLSACYHVCPNYTNFQFDTSFMYMIAGLCMLKLYQKRHP 
DINASAYSAYACLAI VI FFSVLGWFGKGNTAFWI VFS 1 1 HI I ATLLLSTQLYYMGRWKL 
DSGIFRRILHVLYTDCIRQCSGPLYVDRMVLLVMGNVINWSLAAYGLIMRPNDFASYLLA 
IGICNLLLYFAFYIIMKLRSGERIKLIPLLCIVCTSWWGFALFFFFQGLSTWQKTPAES 
REHNRDCILLDFFDDHDIWHFLSSIAMFGSFLVLLTLDDDLDTVQRDKIYVF 
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Figure 117 

GACTTTGCTTGAATGTTTACATTTTCTGCTCGCTGTCCTACATATCACAATATAGTGTTC 

ACGTTTTGTTAAAACTTTGGGGTGTCAGGAGTTGAGCTTGCTCAGCAAGCCAGC 

ATGGCTAGGATGAGCTTTGTTATAGCAGCTTGCCAATTGGTGCTGGGCCTACTAATGACT 

TCATTAAC CGAGTCTTC CATACAGAATAGTGAGTGTC CACAACTTTG CGTATGTGAAATT 

CGTCCCTGG7TTACCCCACAGTCAACTTACAGAGAAGCCACCACTGTTGATTGCAATGAC 

CTCCGCTTAACAAGGATTCCCAGTAACCTCTCTAGTGACACACAAGTGCTTCTCTTACAG 

AGCAATAACATCGCGAAGACTGTGGATGAGCTGCAGCAGCTTTTCAACTTGACTGAACTA 

GATTTCTCCCAAAACAACTTTACTAACATTAAGGAGGTCGGGCTGGCAAACCTAACCCAG 

CTCACAACGCTGCATTTGGAGGAAAATCAGATTACCGAGATGACTGATTACTGTCTACAA 

GACCTCAGCAACCTTCAAGAACTCTACATCAACCACAACCAAATTAGCACTATTTCTGCT 

CATGCTTTTGCAGGCTTAAAAAATCTATTAAGGCTCCACCTGAACTCCAACAAATTGAAA 

GTTATTGATAGTCGCTGGTTTGATTCTACACCCAACCTGGAAATTCTCATGATCGGAGAA 

AACCCTGTGATTGGAATTCTGGATATGAACTTCAAACCCCTCGCAAATTTGAGAAGCTTA 

GTTTTGGCAGGAATGTATCTCACTGATATTCCTGGAAATGCTTTGGTGGGTCTGGATAGC 

CTTGAGAGCCTGTCTTTTTATGATAACAAACTGGTTAAAGTCCCTCAACTTGCCCTGCAA 

AAAGTTCCAAATTTGAAATTCTTAGACCTCAACAAAAACCCCATTCACAAAATCCAAGAA 

GGGGACTTCAAAAATATGCTTCGGTTAAAAGAACTGGGAATCAACAATATGGGCGAGCTC 

GTTTCTGTCGACCGCTATGCCCTGGATAACTTGCCTGAACTCACAAAGCTGGAAGCCACC 

AATAACCCTAAACTCTCTTACATCCACCGCTTGGCTTTCCGAAGTGTCCCTGCTCTGGAA 

AGCTTGATGCTGAACAACAATGCCTTGAATGCCATTTACCAAAAGACAGTCGAATCCCTC 

CCCAATCTGCGTGAGATCAGTATCCATAGCAATCCCCTCAGGTGTGACTGTGTGATCCAC 

TGGATTAACTCCAACAAAACCAACATCCGCTTCATGGAGCCCCTGTCCATGTTCTGTGCC 

ATGCCGCCCGAATATAAAGGGCACCAGGTGAAGGAAGTTTTAATCCAGGATTCGAGTGAA 

CAGTGCCTCCCAATGATATCTCACGACAGCTTCCCAAATCGTTTAAACGTGGATATCGGC 

ACGACGGTTTTCCTAGACTGTCGAGCCATGGCTGAGCCAGAACCTGAAATTTACTGGGTC 

ACTCCCATTGGAAATAAGATAACTGTGGAAAC CCTTTCAGATAAATACAAG CTAAGTAG C 

GAAGGTACCTTGGAAATATCTAACATACAAATTGAAGACTCAGGAAGATACACATGTGTT 

GCCCAGAATGTCCAAGGGGCAGACACTCGGGTGGCAACAATTAAGGTTAACGGGACCCTT 

CTGGATGGTACCCAGGTGCTAAAAATATACGTCAAGCAGACAGAATCCCATTCCATCTTA 

GTGTCCTGGAAAGTTAATTCCAATGTCATGACGTCAAACTTAAAATGGTCGTCTGCCACC 

ATGAAGATTGATAACCCTCACATAACATATACTGCCAGGGTCCCAGTCGATGTCCATGAA 

TACAACCTAACGCATCTGCAGCCTTCCACAGATTATGAAGTGTGTCTCACAGTGTCCAAT 

ATTCATCAGCAGACTCAAAAGTCATGCGTAAATGTCACAACCAAAAATGCCGCCTTCGCA 

GTGGACATCTCTGATCAAGAAACCAGTACAGCCCTTGCTGCAGTAATGGGGTCTATGTTT 

GCCGTCATTAGCCTTGCGTCCATTGCTGTGTACTTTGCCAAAAGATTTAAGAGAAAAAAC 

TACCACCACTCATTAAAAAAGTATATGCAAAAAACCTCTTCAATCCCACTAAATGAGCTG 

TACCCACCACTCATTAACCTCTGGGAAGGTGACAGCGAGAAAGACAAAGATGGTTCTGCA 

GACACCAAGCCAACCCAGGTCGACACATCCAGAAGCTATTACATGTGGTAACTCAGAGGA 

TATTTTGCTTCTGGTAGTAAGGAGCACAAAGACGTTTTTGCTTTATTCTGCAAAAGTGAA 

CAAGTTGAAGACTTTTGTATTTTTGACTTTGCTAGTTTGTGGCAGAGTGGAGAGGACGGG 

TGGATATTTCAAATTTTTTTAGTATAGCGTATCGCAAGGGTTTGACACGGCTGCCAGCGA 

CTCTAGGCTTCCAGTCTGTGTTTGGTTTTTATTCTTATCATTATTATGATTGTTATTATA 

TTATTATTTTATTTTAGTTGTTGTGCTAAACTCAATAATGCTGTTCTAACTACAGTGCTC 

AATAAAATGATTAATGAC^GGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 118 

MARMSFVIAACQLVIiGLLMTSLTESSIQNSECPQLCVCEIRPWFTPQSTYREATTVDCT 
DLRLTRIPS^SSDTQVLI^QSNNIAKTVDELQQLFNLTELDFSQNNFTNIKEVGLANL 
TQLTTLHLEENQITEMTDYCLQDLSNLQELYINHNQISTISAHAFAGLKNLLRLHLNSN 
KLKVIDSRWFDSTPNLEILMIGENPVIGILDMNFKPLANLRSLVLAGMYLTDIPGNALV 
GLDSLESLSFYDNKLVKVPQLALQKVPNLKFLDLNKNPIHKIQEGDFKNMLRLKELGIN 
NMGELVSVDRYALDNLPELTKLEATNN^ 

KTVESLPNLREISIHSNPLRCDCVIHWINSNKTNIRFMEPLSMFCAMPPEYKGHQVKEV 
LIQDSSEQCLPMISHDSFPNRLNVDIGTTVFLOCRAMAEPEPEIYWVTPIGNKITVETL 
SDKYKLSSEGTLEISNIQIEDSGRYTCVAQNV^GADTRVATIKVNGTLLDGTQVLKIYV 
KQTES HS I LVS WKVNSNVMTSNLKWSSATMKIDNPHI TYTARVPVDVHEYNLTHLQPST 
DYEVCLTVSNIHQQTQKSCVNVTTKNAAFAVDISDQETSTALAAVMGSMFAVISLASIA 
VYFAKRFKRKNYHHSLKKYMQKTSSIPLNELYPPLINLWEGDSEKDKDGSADTKPTQVD 
TSRSYYMW 
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Figure 1 19 

CCCACGCGTCCGCCCACGCGTCCGGGTGCCAGTCGCGCGCCGGCCGCGCTCCGGGCTTCT 
CTTTTCCCTCCGACGCGCCACGGCTGCCCAGACATTCCGGCTGCCGGGTCTGGAGAGCTC 
CCCGAACCCCTCCGCGGAGAGGAGCGAGGCGGCGCCAGGGTGGCCCCCGGGGCGCGCTTG 
GTCTCGGAGAAGCGGGGACGAGGCCGGAGGATGAGCGACTGAGGGCGACGCGGGCACTGA 
CGCGAGTTGGGGCCGCGACTACCGGCAGCTGACAGCGCGATGAGCGACTCCCCAGAGACG 
CCCTAGCCCGGTGTGCGCGCCAGGCGGAGCGCGCAGGTGGGGCTGGGCTGTTAGTGGTCC 
GCCCCACGCGGGTCGCCGGCCGGCCCAGGATGGGCGCTGGCAACCCGGGCCCGCGCCCGC 
CGCTGCTACCCCTGCGCCCGCTGCGAGCCCGGCGTCCGGCCCGCGCCCTGCGCTCATGGA 
CGGCGGCTCCCGGCTGGCGGCGGCGCGCCCCCGGGCTGTGAATGCGACTCGCCCGTCGGC 
CGCGCTCCCCGCCCGCCCGCCCGCCGGGACGTGGTAGGGG 

ATGCCCAGCTCCACTGCGATGGCAGTTGGCGCGCTCTCCAGTTCCCTCCTGGTCACCTGC 
TGCCTGATGGTGGCTCTGTGCAGTCCGAGCATCCCGCTGGAGAAGCTGGCCCAGGCACCA 
GAGCAGCCGGGCCAGGAGAAGCGTGAGCACGCCACTCGGGACGGCCCGGGGCGGGTGAAC 
GAGCTCGGGCGCCCGGCGAGGGACGAGGGCGGCAoCGGCCGGGACTGGAAGAGCAAGAGC 
GGCCGTGGGCTCGCCGGCCGTGAGCCGTGGAGCAAGCTGAAGCAGGCCTGGGTCTCCCAG 
GGCGGGGGCGCCAAGGCCGGGGATCTGCAGGTCCGGCCCCGCGGGGACACCCCGCAGGCG 
GAAGCCCTGGCCGCAGCCGCCCAGGACGCGATTGGCCCGGAACTCGCGCCCACGCCCGAG 
CCACCCGAGGAGTACGTGTACCCGGACTACCGTGGCAAGGGCTGCGTGGACGAGAGCGGC 
TTCGTGTACGCGATCGGGGAGAAGTTCGCGCCGGGCCCCTCGGCCTGCCCGTGCCTGTGC 
ACCGAGGAGGGGCCGCTGTGCGCGCAGCCCGAGTGCCCGAGGCTGCACCCGCGCTGCATC 
CACGTCGACACGAGCCAGTGCTGCCCGCAGTGCAAGGAGAGGAAGAACTACTGCGAGTTC 
CGGGGCAAGACCTATCAGACTTTGGAGGAGTTCGTGGTGTCTCCATGCGAGAGGTGTCGC 
TGTGAAGCCAACGGTGAGGTG 

CTATGCACAGTGTCAGCGTGTCCCCAGACGGAGTGTGTGGACCCTGTGTACGAGCCTGAT 
CAGTGCTGTCCCATCTGCAAAAATGGTCCAAACTGCTTTGCAGAAACCGCGGTGATCCCT 
GCTGGCAGAGAAGTGAAGACTGACGAGTGCACCATATGCCACTGTACTTATGAGGAAGGC 
ACATGGAGAATCGAGCGGCAGGCCATGTGCACGAGACATGAATGCAGGCAAATGTAGACG 
CTTCCCAGAACACAAACTCTGACTTTTTCTAGAACATTTTACTGATGTGAACATTCTAGA 
TGACTCTGGGAACTATCAGTCAAAGAAGACTTTTGATGAGGAATAATGGAAAATTGTTGG 
TACTTTTCCTTTTCTTGATAACAGTTACTACAACAGAAGGAAATGGATATATTTCAAAAC 
ATCAACAAGAACT1TGGGCATAAAATCCTTCTCTAAATAAATGTGCTATTTTCACAGTAA 
GTACACAAAAGTACACTATTATATATCAAATGTATTTCTATAATCCCTCCATTAGAGAGC 
TTATATAAGTGTTTTCTATAGATGCAGATTAAAAaTGCTGTGTTGTCAACCGTCAAAAAA 
AAAAAAAAAAAAAAAAAAAAA 
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Figure 120 

MPSSTAMAVGALSSSLLVTCCLMVALCS PS I PLEKLAQAPEQPGQEKREHATRDGPGRVN 
ELGRPARDEGGSGRDWKSKSGRGLAGREPWSKLKQAWVSQGGGAKAGDLQVRPRGDTPQA 
EALAAAAQDAI GPELAPTPEP P EE YVYPD YRGKGCVDESGFVYAI GEKFAPGPS ACPCLC 
TEEGPLCAQPECPRLHPRCIHVDTSQCCPQCKERKNYCEFRGKTYQTLEEFWSPCERCR 
CEANGEVLCTVSACPQTECVDPVYEPDQCCPICKNGPNCFAETAVIPAGREVKTDECTIC 
HCTYEEGTWRI ERQAMCTRHECRQM 
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Figure 121 

CGCGGAGCCCTGCGCTGGGAGGTGCACGGTGTGCACGCTGGACTGGACCCCCATGCAACC 
CCGCGCCCTGCGCCTTAACCAGGACTGCTCCGCGCGCCCCTGAGCCTCGGGCTCCGGCCC 
GGACCTGCAGCCTCCCAGGTGGCTGGGAAGAACTCTCCAACAATAAATACATTTGATAAG 
AAAG 

ATGG CTTTAAAAGTGCTACTAGAACAAGAGAAAACGTTTTTCACTCTTTTAGTATTACTA 
GGCTATTTGTCATGTAAAGTGACTTGTGAATCAGGAGACTGTAGACAGCAAGAATTCAGG 
GATCGGTCTGGAAACTGTGTTCCCTGCAACCAGTGTGGGCCAGGCATGGAGTTGTCTAAG 
GAATGTGGCTTCGGCTATGGGGAGGATGCACAGTGTGTGACGTGCCGGCTGCACAGGTTC 
AAGGAGGACTGGGGCTTCCAGAAATGCAAGCCCTGTCTGGACTGCGCAGTGGTGAACCGC 
TTTCAGAAGGCAAATTGTTCAGCCACCAGTGATGCCATCTGCGGGGACTGCTTGCCAGGA 
TTTTATAGGAAGACGAAACTTGTCGGCTTTCAAGACATGGAGTGTGTGCCTTGTGGAGAC 
CCTCCTCCTCCTTACGAACCGCACTGTGCCAGCAAGGTCAACCTCGTGAAGATCGCGTCC 
ACGGCCTCCAGCCCACGGGA 

CACGGCGCTGGCTGCCGTTATCTGCAGCGCTCTGGCCACCGTCCTGCTGGCCCTGCTCAT 
CCTCTGTGTCATCTATTGTAAGAGACAGTTTATGGAGAAGAAACCCAGCTGGTCTCTGCG 
GTCGCAGGACATTCAGTACAACGGCTCTGAGCTGTCGTGTTTTGACAGACCTCAGCTCCA 
CGAATATGCCCACAGAGCCTGCTGCCAGTGCCGCCGTGACTCAGTGCAGACCTGCGGGCC 
GGTGCGCTTGCTCCCATCCATGTGCTGTGAGGAGGCCTGCAGCCCCAACCCGGCGACTCT 
TGGTTGTGGGGTGCATTCTGCAGCCAGTCTTCAGGCAAGAAACGCAGGCCCAGCCGGGGA 
GATGGTGCCGACTTTCTTCGGATCCCTCACGCAGTCCATCTGTGGCGAGTTTTCAGATGC 
CTGGCCTCTGATGCAGAATCCCATGGGTGGTGACAACATCTCTTTTTGTGACTCTTATCC 
TGAACTCACTGGAGAAGACATTCATTCTCTCAATCCAGAACTTGAAAGCTCAACGTCTTT 
GGATTCAAATAGCAGTCAAGATTTGGTTGGTGGGGCTGTTCCAGTCCAGTCTCATTCTGA 
AAA.CTTTACAGCAGCTACTGATTTATCTAGATATAACAACACACTGGTAGAATCAGCATC 
AACTCAGGATGCACTAACTATGAGAAGCCAGCTAGATCAGGAGAGTGGCGCTGTCATCCA 
CCCAGCCACTCAGACGTCCCTCCAGGAAGCTTAAAGAACCTGCTTCTTTCTGCAGTAGAA 
GCGTGTGCTGGAACCCAAAGAGTACTCCTTTGTTAGGCTTATGGACTGAGCAGTCTGGAC 
CTTGCATGGCTTCTGGGGCAAAAATAAATCTGAACCAAACTGACGGCATTTGAAGCCTTT 
C^GCCAGTTGCTTCTGAGCCAGACCAGCTGTAAGCTGAAACCTCAATGAATAACAAGAAA 
AGACTCCAGGCCGACTCATGATACTCTGCATCTTTCCTACATGAGAAGCTTCTCTGCCAC 
AAAAGTGACTTCAAAGACTGATGGGTTGAGCTGGCAGCCTATGAGATTGTGGACATATAA 
CAAGAAACAGAAATGCCCTCATGCTTATTTTCATGGTGATTGTGGTTTTACAAGACTGAA 
GACCCAGAGTATACTTTTTCTTTCCAGAAATAATTTCATACCGCCTATGAAATATCAGAT 
AAATTACCTTAGCTTTTATGTAGAATGGGTTCAAAAGTGAGTGTTTCTATTTGAGAAGGA 
CACTTTTTCATCATCTAAACTGATTCGCATAGGTGGTTAGAATGGCCCTCATATTGCCTG 
CCTAAATCTTGGGTTTATTAGATGAAGTTTACTGAATCAGAGGAATCAGACAGAGGAGGA 
TAGCTCTTTCCAGAATCCACACTTCTGACCTCAGCCTCGGTCTCATGAACACCCGCTGAT 
CTCAGGAGAACACCTGGGCTAGGGAATGTGGTCGAGAAAGGGCAGCCCATTGCCCAGAAT 
TAACACATATTGTAGAGACTTGTATGCAAAGGTTGGCATATTTATATGAAAATTAGTTGC 
TATAGAAAC^TTTGTTGCATCTGTCCCTCTGCCTGAGCTTAGAAGGTTATAGAAAAAGGG 
TATTTATAAACATAAATGACCTTTTACTTGCATTGTATCTTATACTAAAGGCTTTAGA^ 
TTACAACATATCAGGTTCCCCTACTACTGAAGTAGCCTTCCGTGAGAACACACCACATGT 
TAGGACTAGAAGAAAATGCACAATTTGTAGGGGTTTGGATGAAGCAGCTGTAACTGCCCT 
AGTGTAGTTTGACCAGGACATTGTCGTGCTCCTTCCAATTGTGTAAGATTAGTTAGCACA 
TCATCTCCTACTTTAGCCATCCGGTGTTGGATTTAAGAGGACGGTGCTTCTTTCTATTAA 
AGTGCTCCATCCCCTACCATCTACACATTAGCATTGTCTCTAGAGCTAAGACAGAAATTA 
ACCCCGTTCAGTCACAAAGCAGGGAATGGTTCATTTACTCTTAATCTTTATGCCCTGGAG 
AAGACCTACTTGAACAGGGCATATTTTTTAGACTTCTGAACATCAGTATGTTCGAGGGTA 
CTATGATATTTTGGTTTGGAATTGCCCTGCCCAAGTCACTGTCTTTTAACTTTTAAACTG 
AATATTAAAATGTATCTGTCTTTCCT 
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Figure 122 

MALICVLLEQEKTFFTLLVLLGYLSCKVTCESGDCRQQEFRDRSGNCVPCNQCGPGMELSK 
ECGFGYGEDAQCVTCRLHRFKEDWGFQKCKPCLDCAWNRFQKANCSATSDAICGDCLPG 
FYRKTKLVGFQDMECVP CGDP P P P YE PHCAS KVNLVKI ASTAS S P RDTALAAV I CSALAT 
VLIJUiLILCVIYCKRQFMEKKPSWSLRSQDIQYNGSELSCFDRPQLHEYAHRACCQCRRD 
SVQTCGPVRLLPSMCCEEACSPNPATLGCGVHSAASLQARNAGPAGEMVPTFFGSLTQSI 
CGEFSDAWPLMQNPMGGDNISFCDSYPELTGEDIHSLNPELESSTSLDSNSSQDLVGGAV 
PVQSHSENFTAATDLSRYNNTLVESASTQDALTMRSQLDQESGAVIHPATQTSLQEA 
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Figure 123 

GGGTGATTGAACTAAACCTTCGCCGCACCGAGTTTGCAGTACGGCCGTCACCCGCACCGC 

TGCCTGCTTGCGGTTGGAGAAATCAAGGCCCTACCGGGCCTCCGTAGTCACCTCrCTATA 

GTGGGCGTGGCCGAGGCCGGGGTGACCCTGCCGGAGCCTCCGCTGCCAGCGAC 

ATGTTCAAGGTAATTCAGAGGTCCGTGGGGCCAGCCAGCCTGAGCTTGCTCACCTTCAAA 

GTCTATGCAGCACCAAAAAAGGACTCACCTCCCAAAAATTCCGTGAAGGTTGATGAGCTT 

TCACTCTACTCAGTTCCTGAGGGTCAATCGAAGTATGTGGAGGAGGCAAGGAGCCAGCTT 

GAAGAAAGCATCTCACAGCTCCGACACTATTGCGAGCCATACACAACCTGGTGTCAGGAA 

ACGTACTCCCAAACTAAGCCCAAGATGCAAAGTTTGGTTCAATGGGGGTTAGACAGCTAT 

GACTATCTCCAAAATGCACCTCCTGGATTTTTTCCGAGACTTGGTGTTATTGGTTTTGCT 

GGCCTTATTGGACTCCTTTTGGCTAGAGGTTCAAAAATAAAGAAGCTAGTGTATCCGCCT 

GGTTTCATGGGATTAGCTGCCTCCCTCTATTATCCACAACAAGCCATCGTGTTTGCCCAG 

GTCAGTGGGGAGAGATTATATGACTGGGGTTTACGAGGATATATAGTCATAGAAGATTTG 

TGGAAGGAGAACTTTCAAAAGCCAGGAAATGTGAAGAATTCACCTGGAACTAAGTAGAAA 

ACTCCATGCTCTGCCATCTTAATCAGTTATAGGTAAACATTGGAAACTCCATAGAATAAA 

TCAGTATTTCTACAGAAAAATGGCATAGAAGTCAGTATTGAATGTATTAAATTGGCTTTC 

TTCTTCAGGAAAAACTAGACCAGACCTCTGTTATCTTCTGTGAAATCATCCTACAAGCAA 

ACTAACCTGGAATCCCTTCACCTAGAGATAATGTACAAGCCTTAGAACTCCTCATTCTCA 

TGTTGCTATTTATGTACCTAATTAAAACCCAAGTTTAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAA 
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Figure 124 

MFKVIQRSVGPASLSLLTFKVYAAPKKDSPPKNSVKVDELSLYSVPEGQSKYVEEARSQL 
EESISQLRHYCEPYTTWCQETYSQTKPKMQSLVQWGLDSYDYLQNAPPGFFPRLGVIGFA 
GLIGLLLARGSKIKKLVYPPGFMGLAASLYYPQQAIVFAQVSGERLYDWGLRGYIVIEDL 
WKENFQKPGNVKNS PGTK 
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Figure 125 

TTGAAAATCTACTCTATCAGCTGCTGTGGTTGCCACCATTCTCAGGACCCTCGCC 

ATGA AAGCCCTTATGCTGCTCACCCTGTCTGTTCTGCTCTGCTGGGTCTCAGCT 

GACATTCGCTGTCACTCCTGCTACAAGGTCCCTGTGCTGGGCTGTGTGGACCGGCAGTCC 

TGCCGCCTGGAGCCAGGACAGCAATGCCTGACAACACATGCATACCTTGGTAAGATGTGG 

GTTTTCTCCAATCTGCGCTGTGGCACACCAGAAGAGCCCTGTCAGGAGGCCTTCAACCAA 

ACCAACCGCAAGCTGGGTCTGACATATAACACCACCTGCTGCAACAAGGACAACTGCAAC 

AGCGCAGGACCCCGGCCCACTCCAGCCCTGGGCCTTGTCTTCCTTACCTCCTTGGCTGGC 

CTTGGCCTCTGGCTGCTGCAC TGA GACTCATTCCATTGGCTGCCCCTCCTCCCACCTGCC 

TTGGCCTGAGCCTCTCTCCCTGTGTCTCTGTATCZeCTGGCTTTACAGAATCGTCTCTCC 

CTAGCTCCCATTTCTTTAATTAAACACTGTTCCGAGTGGTCTCCTCATCCATCCTTCCCA 

CCTCACACCCTTCACTCTCCTTTTTCTGGGTCCCTTCCCACTTCCTTCCAGGACCTCCAT 

TGGCTCCTAGAAGGGCTCCCCACTTTGCTTCCTATACTCTGCTGTCCCCTACTTGAGGAG 

GGATTGGGATCTGGGCCTGAAATGGGGCTTCTGTGTTGTCCCCAGTGAAGGCTCCCACAA 

GGACCTGATGACCTCACTGTACAGAGCTGACTCCCCAAACCCAGGCTCCCATATGTACCC 

CATCCCCCATACTCACCTCTTTCCATTTTGAGTAATAAATGTCTGAGTCTGGAAAAAAAA 

AAAAAAAAAA 
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Figure 126 
MKALMLLTLSVLLCWVSADIRCHSCYKVPVLGCVDRQ 

SNIJiCGTPEEPCQEAFNQTNRKLGLTYNTTCCNKDNCNSAGPRPTPAIiGLVFLTSLAGLG 
LWLLH 
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Figure 127 

ACC 

ATGGATTG CCAAGAAAATGAGTACTGGGAC CAATGGGGACGGTGTGTCACCTGCCAACGG 
TGTGGTCCTGGACAGGAGCTATCCAAGGATTGTGGTTATGGAGAGGGTGGAGATGCCTAC 
TGCACAGCCTGCCCTCCTCGCAGGTACAAAAGCAGCTGGGGCCACCACAGATGTCAGAGT 
TGCATCACCTGTGCTGTCATCAATCGTGTTCAGAAGGTCAACTGCACAGCTACCTCTAAT 
GCTGTCTGTGGGGACTGTTTGCCCAGGTTCTACCGAAAGACACGCATTGGAGGCCTGCAG 
GACCAAGAGTGCATCCCGTGCACGAAGCAGACCCCCACCTCTGAGGTTCAATGTGCCTTC 
CAGTTGAGCTTAGTGGAGGCAGATGCACCCACAGTGCCCCCTCAGGAGGCCACACTTGTT 
GCACTGGTGAGCAGCCTGCTAGTGGTGTTTACCCTGGCCTTCCTGGGGCTCTTCTTCCTC 
TACTGCAAGCAGTTCTTO\ACAGACATTGCCAGCGTGTTACAGGAGGTTTGCTGCAGTTT 
GAGGCTGATAAAACAGCAAAGGAGGAATCTCTCTTCCCCGTGCCACCCAGCAAGGAGACC 
AGTGCTGAGTC CCAAGTGAGTGAGAACATCTTTCAGACCCAGCCACTTAACC CTATCCTC 
GAGGACGACTGCAGCTCGACTAGTGGCTTCCCCACACAGGAGTCCTTTACCATGGCCTCC 
TGCACCTCAGAGAGCCACTCCCACTGGGTCCACAGCCCCATCGAATGCACAGAGCTGGAC 
CTGCAAAAGTTTTCCAGCTCTGCCTCCTATACTGGAGCTGAGACCTTGGGGGGAAACACA 
GTCGAAAGCACTGGAGACAGGCTGGAGCTCAATGTGCCCTTTGAAGTTCCCAGCCC TTAA 
GC 
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Figure 128 

MDCQENEYWDQWGRCVTCQRCGPGQELSKDCGYGEGGDAYCTACPPRRYKSSWGHHRCQ 
SCITC^VINRVQKWCTATSNAVCGDCLPRFYRKTRIGGLQDQECIPCTKQTPTSEVQC 
AFQLSLVEADAPTVP PQEATLVALVS S LLVVFTLAFLGLFFLYCKQ F FNRHCQRVTGGL 
LQFEADKTAKEESLFPVPPSKETS AESQVS ENI FQTQPLNP I LEDDCSSTSGFPTQES F 
TMASCTSESHSHWVHSPIECTELDLQKFSSSASYTGAETLGGNTVESTGDRLELNVPFE 
VPSP 
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